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ABSTRACT 


^  The  Stylist  is  a  Waterloo  Pascal  program  that  analyzes  the  style  of  English  prose. 
A  "'style  checker’^The  Stylist  pertains  to  Computational  Stylistics  and  Computer 
Assisted  Composition  Instruction  (CACI).  The  Stylist  creates  an  affective  model  of 
the  text' based  upon  ‘the  following  characteristics  of  its  component  words  :  etymology, 
tangibility,  difficulty,  emotional  connotation  and  vigor.  The  Stylist  then  compares  this 
model  to  the  standards  of  fiction  or  nonfiction  texts  and  reports  results  and 
recommendations  to  the  user. 

The  Stylist  also  creates  a  concordance  of  the  user's  input  text  using  a  new  data 
structure  called  a  Concordance  Search  Tree  (CST).  A  CST  is  a  binary  search  tree  with 
a  linked  list  threaded  through  it  recording  the  order  of  the  use  of  each  word.  An 
inordcr  traversal  of  the  tree,  with  a  traversal  of  the  linked  list  during  each  visit,  creates 
a  concordance.  \ 

This  thesis  also  reviews  related  literature  and  programs.  (  /)£$&■  ?  '  , 
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I.  INTRODUCTION 


A.  BACKGROUND 

1.  Computational  Stylistics 

— *■  Computational  Stylistics  is  the  computer-assisted  study  of  literary  style, 
Computational  Linguistics  is  the  computer-assisted  study  of  language  itself.  These 
disciplines  are  the  automated  subsets  of  Statistical  (or  Quantitative)  Stylistics  and 
Linguistics,  which  are  the  mathematical  studies  of  style  and  language.  All  of  these 
disciplines  involve  quantifying  aspects  of  language  and  then  manipulating  these 
quantities  in  an  attempt  to  gain  insight  into  how  and  why  the  language  works.  ^ 

The  history  of  stylistics  can  be  traced  back  to  1851,  when  Augustus  de 
Morgan  suggested  that  word-length  could  prove  to  be  a  distinctive  trait  of  a  writer's 
style.  [Ref.  1]  This  suggestion  prompted  T.C.  Mendenhall,  an  American  geophysicist, 
to  investigate  whether  mean  word-length  could  resolve  authorship  problems  such  as 
those  posed  by  some  of  the  disputed  Shakespeare  plays  or  the  letters  written  under  the 
pen  name  of  Junius.  Working  in  the  late  19th  century,  Mendenhall  analyzed  the  word- 
lengths  of  some  two  million  words  from  various  periods  of  English  literature,  using  a 
primitive  tabulating  device  that  spit  out  reels  of  paper.  His  results,  however,  proved 
little. 

In  the  early  20th  century,  another  possible  characteristic  of  style,  the 
frequency  distribution  of  words,  came  under  wide-spread  investigation,  G.K.  Zipf 
postulated  a  "Rank- Frequency  Law",  by  which  a  ranking  of  the  use  of  words  in  a  text 
would  show  a  constant  decrease  from  the  most-us*d  word  down  to  the  least-used  word. 
[Ref,  2[  Other  scholars,  such  as  G.  U.  Yule,  investigated  such  aspects  as  the  richness  of 
a  writer's  vocabulary  and  the  length  of  sentences.  [Ref.  3:  pp.  363  -  390]  By  the  1950s 
and  1960s,  the  application  of  statistical  methods  to  the  study  of  literature  had  reached 
new  heights  of  sophistication  and  complexity,  Some  scholars  were  investigating  the 
significance  of  the  ratio  of  verbs  to  adjectives  and  others  were  applying  rigorous 
statistical  techniques.  Despite  the  many  fascinating  insights  offered  by  these  lines  of 
inquiry',  their  final  significance  and  scientific  credibility  remained  a  question  of  much 
dispute.  Enemies  of  the  new  disciplines  included  Norm  Chomsky,  the  pioneer  of 
formal  languages,  who  argued  that  writing  is  a  very  human  activity  which  involves  a 
great  deal  of  chance,  and  as  such  is  not  easily  quantified.  [Ref.  4] 


Computational  stylistics  began  as  a  natural  outgrowth  of  statistical  stylistics. 
With  the  spread  of  computer  access  some  thirty  years  ago,  some  scholars  quickly 
realized  the  potential  for  their  use  in  literary  studies; 

"I  first  heard  of  computers  in  1955  when  my  wife  (a  mathematician)  told  me  that 
she  was  going  to  work  for  an  oil  company  as  a  program  analyst ...  It  took  me  a 
while  to  understand  her  flowcharts  and  computer  programs,  but  when  I  did,  I 
realized  that  a  computer  could  be  used  to  solve  other  problems  as  well.  Since 
then,  I  have  used  a  computer  for  numerous  applications  relating  to  my  work  in 
the  English  Department  of  Cleveland  State  University,  and  have  taught  many 
others  ...  the  power  of  electronic  data  processing  in  the  study  of  language  and 
literature."  (Ref.  5] 

The  first  challenge  to  the  field  of  Computational  Stylistics  was  to  translate 
literary  texts  to  electronic  data.  Much  of  the  early  literature  of  the  field  is  devoted  to 
this  basic  problem.  After  the  first  two  decades,  corpora,  or  bodies  of  literary  texts  in 
machine  readable  form,  had  been  developed.  The  Lancaster-Oslo/Bergen  (LOB) 
Corpus  is  a  structured  collection  of  500  two-thousand-word  texts  of  written  British 
English.  An  American  version  of  this  effort  is  the  Brown  Corpus,  constructed  by 
Brown  University.1 

In  addition,  many  literary  classics  have  been  painstakingly  entered  by 
keyboard.  They  are  available  from  the  Oxford  Computing  Laboratory  at  Oxford 
University  and  the  American  Philological  Association.  [Ref.  5]  With  the  increasing 
capability  and  decreasing  cost  of  optical  readers,  the  problem  of  translating  literary  text 
to  electronic  data  should  become  trivial. 

The  usefulness  of  computers  to  statistical  stylistics  quickly  became 
indisputable.  Scholars  used  to  labor  to  copy  down  on  3"  X  5"  cards  each  occurrence  of 
every  noun  used  in  a  Shakespeare  play  and  then  sort  these  cards  by  hand.  [Ref.  6;  pp. 
33  -  50]  Now  a  computer  program  could  manipulate  the  text  in  any  number  of  ways 
within  minutes.  One  of  the  other  prominent  successes  of  the  use  of  computers  in 
literary  studies  was  the  automation  of  the  process  of  forming  concordances.  A 
concordance  is  an  alphabetical  listing  of  all  the  significant  words  used  in  a  text, 
together  with  the  enclosing  phrase.  Before  the  advent  of  computers,  scholars 
laboriously  built  concordances  of  the  Bible  and  the  plays  of  Shakespeare.  The  power 
of  the  computer  made  possible  giant  concordance-building  projects  such  as  one  for  the 

Electronic  forms  of  both  of  these  corpora  are  available  from:  The  Norwegian 
Computational  Center  for  Humanistic  Research,  P.O.  Box  53,  University  of  Bergen 
N-5014,  Bergen,  Norway 


Dead  Sea  Scrolls.  The  Centro  Atomazione  Analisi  Linguista  in  Gallarate,  Italy,  used 
computers  to  build  this  concordance,  which  helped  to  resolve  some  of  the  missing  or 
obliterated  words.  [Ref.  7j 

One  of  the  next  important  challenges  to  computational  linguistics  was  the 
issue  of  disambiguation.  Many  words  have  more  than  one  meaning  and  can  belong  to 
more  than  one  parts  of  speech.  An  example  in  English  is  the  word  "flies"  in  the 
following  two  sentences: 

Time  flies  like  an  arrow. 

Fruit  flies  like  an  apple. 


A  human  reader  can  easily  decide  that  "flies"  in  the  first  sentence  is  a  verb  and 
that  "flies"  in  the  second  sentence  is  a  noun.  This  decision  is  disambiguation.  In  order 
to  compute  such  characteristics  as  verb-noun  ratio,  it  is  first  necessary  to  disambiguate 
the  parts  of  speech.  Recent  advances  in  computational  linguistics  have  led  to 
programs  that  can  do  this  with  more  than  90%  success.  [Ref.  8]  [Ref.  9:  pp.  139  •  150[ 
[Ref.  10] 

Scholars  continue  to  argue  about  the  usefulness  of  statistical  stylistics,  Efforts 
to  discover  and  prove  laws  of  distribution  have  not  met  with  unchallenged  success, 
Nevertheless,  the  work  done  in  the  past  130  years  has  laid  some  groundwork  for  the 
use  of  computers  to  study  and  analyze  written  prose.  All  of  the  professorial  attempts 
thus  have  led  to  success  at  a  humbler  level  :  the  tutorial.  The  history  of  stylistics  is 
now  culminating  at  this  level,  in  the  form  of  Computer  Assisted  Composition 
Instruction. 

2.  Computer  Assisted  Composition  Instruction  (CACI) 

CACI  is  a  new  discipline  which  has  begun  to  emerge  in  the  past  decade,  The 
success  of  word  processing  programs  created  a  growing  population  of  people  who 
expected  computers  to  help  them  to  write.  Enterprises  such  Bell  Laboratories  and 
centers  of  learning  such  as  The  Pennsylvania  State  University  began  to  borrow  from 
the  field  of  stylistics  as  they  developed  computer  programs  that  would  help  students 
and  workers  plan,  write  and  edit  prose.  Several  universities,  such  as  Colorado  State 
University,  successfully  instituted  CACI  as  part  of  their  composition  curriculum. 
[Ref,  11]  The  earliest  CACI  programs  required  a  mainframe  or  at  least  a  minicomputer, 


With  the  expansion  of  the  power  of  personal  computers,  however,  software  houses 
such  as  DecisionWare  Inc.  began  to  write  CACI  programs  for  that  environment.  Over 
10,000  copies  of  DecisionWare's  RightWriter  are  now  in  use  at  corporations,  agencies 
and  universities  around  the  world,  according  to  their  advertisements.  Given  that  word 
processing  remains  the  single  most  common  use  of  personal  computers,  as  one 
megabyte  of  main  storage  becomes  commonplace  on  new  personal  computers,  style 
checkers  such  as  RightWriter  may  soon  become  as  popular  as  spell  checkers  have 
become  in  the  past  five  years. 

B.  SCOPE  OF  THE  STYLIST 

The  Stylist  is  a  style  checker  akin  to  Bell  Laboratory's  Writers  Workbench  and 
DecisionWare  Inc.'s  RightWriter.  It  does  not  help  the  user  to  plan  or  to  compose  his 
product.  Instead,  it  analyzes  the  finished  product  and  provides  that  analysis  to  the 
user  for  his  consideration  toward  revision.  The  Stylist  does  not  determine  parts  of 
speech,  as  does  Writer's  Workbench;  nor  does  it  suggest  alternatives  to  hackneyed 
phrases,  as  does  RightWriter.  What  the  Stylist  does  instead  is  to  analyze  some  of  the 
characteristics  of  the  words  used  in  the  text.  By  doing  so,  The  Stylist  can  distinguish 
between  short,  vigorous,  germanic,  emotional  texts  and  long,  lazy  latin  texts.  The 
success  of  this  effort  could  and  should  be  incorporated  into  more  extensive  programs 
such  as  Writer's  Workbench. 

In  addition,  The  Stylist  solves  the  traditional  problem  of  creating  a  concordance 
in  a  novel  and  elegant  way.  This  solution  is  optimal  for  the  personal  computer 
environment  of  the  coming  few  years. 
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II.  RESEARCH  FOR  THE  STYLIST 


A.  LITERATURE  REVIEW 

When  I  first  conceived  of  The  Stylist,  I  believed  that  a  "style  checker"  was  a 
completely  original  idea.  Little  did  I  know  that  major  universities  and  great 
coportations  had  been  working  on  the  problem  for  decades.  My  research  began  with 
the  Encyclopedia  Brittanica,  where  I  discovered  the  existance  of  the  fields  of  stylistics 
and  statistical  linguistics.  Using  these  as  subjects,  I  searched  the  Dewey  decimal 
system  and  several  automated  data  bases  for  titles.  The  books  1  discovered  referenced 
the  two  journals  which  publish  many  of  the  pertinent  articles  :  Computational 
Linguistics,  and  Computers  and  the  Humanities.  These,  in  turn,  carried  advertisements 
for  some  of  the  currently  available  software.  They  also  identified  the  current  centers  of 
learning,  some  of  which  I  contacted  for  guidance  and  information.  The  experience  of 
researching  the  field  demoted  me  from  its  inventor  to  its  lowliest  tyro.  It  also  subjected 
me  to  many  ideas,  some  of  which  I  incorporated  into  The  Stylist.  The  bibliography 
lists  the  sources  which  I  unearthed  which  may  benefit  other  students.  In  this 
discussion,  I  will  cover  only  those  articles  which  directly  influenced  the  development  of 
The  Stylist. 

I.  Affective  Tone 

C.W.  Anderson  and  G.E.  McMaster  reported  on  a  program  called  PSA 
(Psychological  Semantic  Analysis)  which  analyzes  the  emotional  tone  of  a  text. 
[Ref.  12J  They  built  upon  the  work  of  D.R.  Heise,  who  ascribed  values  of  "Evaluation, 
Activity  and  Potency"  to  the  thousand  most  commonly-used  words  in  the  English 
language.  [Ref.  13]  PSA  allows  the  user  to  enter  his  text  one  line  at  a  time;  if  any  word 
is  ambiguous,  PSA  asks  the  user  to  disambiguate.  PSA  then  matches  these  words  to 
the  1000  Heise-word  dictionary,  adding  values  up  to  create  a  profile  of  the  user's  text. 
Table  1  helps  clarify  the  meanings  of  the  three  Heise-word  categories. 

After  building  and  testing  PSA,  Anderson  and  McMaster  concluded  that: 
"The  affective  tone  of  whole  passages  can  be  measured  by  computer-assisted  collection 
of  .  .  ,  scores  of  those  words  ...  for  which  Heise  (1965)  has  provided  semantic 
meanings.''  [Ref.  12] 


TABLE  1 

HEISE  WORDS  VALUES 


EVALUATION 


High 

Neutral 

Low 

church 

God 

beauty 

pleasant 

experiment 

prove 

mountain 

suggest 

W 

fire 

disease 

bad 

ACTIVITY 

High 

Neutral 

Low 

fire 

great 

attack 

fight 

well 

fis? 

presence 

ftUr* 

sleep 

rock 

POTENCY 

High 

Neutral 

Low 

steel 

iron 

rock 

hard 

rich 

all 

indicate 

love 

kiss 

baby 

wife 

The  original  idea  of  The  Stylist  envisioned  doing  just  such  a  count  of  words 
chosen  for  their  frequency  of  use  and  their  characteristics  such  as  etymology,  emotional 
connotation  and  vigor.  Anderson  and  McMaster  convinced  me  that  this  method  had  a 
sound  psychological  base.  Moreoever,  Heise's  categories  of  Evaluation  and  Activity 
seemed  to  correspond  directly  to  my  envisioned  categories  of  emotional  connotation 
and  vigor.  His  category  of  Potency,  however,  struck  me  and  continues  to  strike  me  as 
distressingly  Freudian.  Because  The  Stylist  would  not  use  that  category  and  because  it 
would  use  others,  I  decided  not  to  borrow  Heise's  words  and  values,  but  rather  to  build 
my  own  dictionary  and  ascribe  my  own  values.  This  seemed  a  reasonable  approach, 
given  the  statement  by  Anderson  and  McMasters  that  "there  is  much  commonality  in 
the  emotional  response  of  different  persons  to  words  and  objects." 

2,  Readability  and  Sentence  and  Word  L  ength 

Two  of  the  most  common  stylistic  measures  are  sentence  length  and  word 
length.  The  most  common  use  of  these  measures  is  to  determine  readability.  The  basic 
idea  is  that  short  sentences  are  easy  to  read  and  long  sentences  are  hard  to  read. 
Various  formulae,  such  as  the  Kincaid,  the  Automated  Readability  Index  (ARI),  the 


Colcman-Eiau,  and  the  Flesch  Reading  Ease  Score,  (Ref,  10]  attempt  to  determine 
readability  as  a  straight-forward  function  of  sentence  and  word  length.  These  formulae 
have  hud  considerable  impact  on  the  teaching  of  English,  Most  style  checkers  use 
them.  Some,  like  RlghtVVritcr,  admonishes  the  writer  any  time  a  sentence  grows 
beyond  22  words  or  so.  Having  read  a  great  deal  of  Victorian  novels  and  Madison 
Avenue  copy,  I'm  well  aware  that  the  ideal  20th  century  sentence  is  short.  It's 
vigorous,  Easy  to  understand,  Maybe  even  dispenses  with  its  verb  as  it  rushes  toward 
its  punctuution!  The  mechanization  of  this  modern  tendency  into  simplistic  formulae, 
however,  scents  to  me  pernicious.  A  sentence  should  be  the  unit  of  a  complete  idea.  If 
we  limit  ourselves  to  short  sentences,  we  may  be  limiting  ourselves  to  small  ideas.  The 
averugc  sentence  of  the  prose  of  the  Age  of  Reason  was  45  words.  If  Samuel  Johnson 
were  to  live  today,  no  one  would  let  him  finish  a  sentence.  My  distrust  of  these 
formulae  redoubled  when  I  read  "Readability  is  a  Four-Ectter  Word,"  by  Jack  Seize r. 
[Ref.  \A\  In  this  article,  Sclzcr  forcefully  makes  the  points  that  readability  is  a 
subjective  quality  which  is  greatly  influenced  by  factors  such  as  arrangement  of  ideas, 
reader  background  and  interest,  and  difficulty  of  vocabulary.  The  Stylist  does  not  use 
readability  formulae.  It  does  ilug  sentences  which  seem  to  be  run-on.  More 
importantly,  it  guuges  tilt  dilTiculty  of  the  vocabulary.  If  the  writer  uses  hard  words, 
long  words,  latinato  words  and  long  sentences,  The  Stylist  warns  him,  particularly  if  the 
work  is  fiction, 

B.  PRODUCT  TESTING 
I.  RlghtWrlter 

As  part  of  my  research,  I  procured  a  copy  of  RightWritcr2  and  tested  it  with 
some  of  my  own  writing.  Overall,  the  program  impressed  me  with  its  capabilities  and 
its  engineering.  RightWritcr  reproduces  the  users  text  with  inserted  comments.  Its 
constant  challenge  of  long  or  complex  sentences  forced  me  to  reexamine  each  case  in 
particular.  Several  lengthy  sentences  became  two  short  ones,  It  never  let  me  begin  a 
sentence  with  “But,"  a  foible  of  mine.  It  applauded  my  writing  when  it  was  strong  and 
it  derided  it  when  it  was  pompous.  One  attractive  feature  was  the  production  of  a 
alphabetical  word  list  with  frequency  of  occurrence.  This  feature  demonstrated  my 
overfondness  for  the  words,  "ancient"  and  "thousand".  RightWritcr  also  suggested 

2 RightWritcr  (tin)  is  a  product  of  Decisionwarc  Inc.,  2033  Wood  St.,  Suite  218, 
Sarasota,  FL  33577,  (813)  952-9211. 


substitutes  for  hackneyed  or  useless  phrases  such  as  "the  fact  that".  Overall, 
RightVVriter  proved  itself  extensively  useftrl. 

2.  PC-Style 

PC-Styie3  is  a  much  less  ambitious  program  than  RightVVriter.  PC-Style 
required  only  40K  of  RAM,  compared  to  192K  for  Right  Writer.  PC-Style  is  also  much 
cheaper,  costing  only  S29.95  as  compared  to  RightWritcr's  S95.  I  tested  it  with  the 
same  test  data  set  I  had  used  on  Right  Writer.  PC-Style  has  a  nice  human-factor 
feature,  in  that  it  constantly  displays  to  the  user  how  much  it  has  done  and  how  fur  it 
is  from  finishing.  RightVVriter  is  more  a  colfce-brcak  program  :  you  execute  it  and 
then  you  go  make  some  codec.  PC-Style,  however,  had  little  to  recommend  itself 
besides  this  feature.  It  relies  upon  a  readability  formula.  It  also  attempts  some 
affective  modeling,  based  upon  its  dictionary  of  50  action  verbs.  This  miniscule 
dictionary  is  inadequate  for  the  task.  A  match  of  less  than  two  percent  of  my  input 
text  with  these  50  words  was  typical;  such  a  small  sample  is  inadequate  to  qualify  the 
vigor  of  a  passage.  Another  simplistic  but  more  valuable  tactic  of  PC-Style  was  to 
count  the  frequency  of  “personal"  words,  such  as  “1”,  “you"  and  “we“,  Although  the 
conventions  of  the  more  study  forms  of  writing  forbid  them,  it  is  generally  accepted 
that  most  technical  writing  benefits  from  direct,  personal  pronouns.  PC-Style 
reinforces  the  clarity  and  forccfUlncss  of  direct  rhetoric.  Despite  these  few  nice 
features,  PC-Style  is  a  too  simplistic  to  be  of  lasting  utility. 

3.  Writer's  Workbench 

Although  l  was  unable  to  experience  Writer’s  Workbench,  1  did  obtain 
enough  research  materials  to  form  an  impression  of  its  utility.  (Ref.  10)  Writer  s 
Workbench4  is  actually  a  complex  of  32  programs.  Together,  these  programs  provide 
more  than  all  the  features  of  RightVVriter.  STYLE  calculates  readability,  using  the 
previously-discussed  formulae.  It  also  analyzes  sentence  type  (simple,  complex  or 
compound).  STYLE  is  able  to  disambiguate  the  words  of  the  input  text  with  95% 
accurucy.  It  then  analyzes  the  use  of  verbs  nnd  modifiers.  If  a  passage  relies  too 
heavily  on  the  passive  voice  or  it  is  fat  with  modifiers,  STYLE  warns  the  user.  The 
follow-on  program,  DICTION,  detects  hackneyed  phrases.  SUGGEST  suggests 
replacements.  Overall,  Writer’s  Workbench  appears  to  be  the  industry  standard. 


3PC-Stylc  (tm)  is  a  product  of  ButtonWare  Inc,,  P.O.  Box  57S6,  Bellevue,  WA 
98006,  (206)  454-0479. 

^Writer's  Workbench  (tm)  is  a  product  of  Bell  Laboratories,  Murray  Hill,  New 
Jersy,  07974. 


C.  CONCLUSIONS  BASED  UPON  RESEARCH 

Now  I  knew  that  I  would  not  be  writing  the  first  style  checker.  Nor  would  I  be 
writing  the  second,  third,  fourth  or  fifth.  Some  solace  could  be  found  in  the  fact  that 
the  previous  programs  relieved  me  of  the  need  to  incorporate  all  stylistic  features  into 
The  Stylist.  I  could  extend  the  affective  modeling  of  PSA  by  including  other  values  of 
words.  Three  of  these  new  values  could  be  more  objective-etymology,  difficulty  and 
tangibility.  Together  with  vigor  and  emotional  connotation,  these  values  should  be 
able  to  describe  a  profile  of  the  user’s  input  text. 

Additionally,  I  could  increase  the  accuracy  of  such  style  checkers  by  pointing  out 
the  need  to  make  allowances  for  the  genre  of  the  user's  input  text.  Any  reader  of 
technical  as  well  as  fictional  writing  knows  that  the  characteristics  of  these  two  types 
often  vary  more  widely  than  the  style  of  writers  within  the  type.  Comparative  style 
analysis,  therefore,  would  clearly  seem  to  need  to  take  the  genre  into  account, 


III.  DESIGN  OF  THE  STYLIST 


A.  CHARACTERISTICS  OF  WORDS 

The  fundamental  idea  of  The  Stylist  is  that  individual  words  have  power.  The 
denotation  of  a  word  is  its  meaning.  The  connotation  of  a  word  is  its  emotional 
impact.  For  example,  "pupil",  "student",  and  "scholar"  all  denote  a  person  who  studies 
and  learns.  The  emotional  connotations,  however,  range  from  the  humble  "pupil" 
through  the  familiar  "student"  to  the  lofry  “scholar".  Besides  these  emotional 
connotations,  words  have  other  intrinsic  values  which  can  be  quantified.  The  Stylist 
would  concentrate  on  the  following  values:  Etymology,  Tangibility,  Difficulty, 
Emotional  Connotation  and  Vigor, 

1.  Etymology 

One  of  the  beauties  of  the  English  language  is  that  its  vocabulary  embraces 
two  main  sources:  native  and  borrowed,  Our  native  words  mainly  come  to  us  from  the 
Anglo-Saxon  tongues.  Borrowed  words  come  from  Latin,  mostly  by  way  of  the  French 
of  the  Norman  conquest.  The  following  tabic  Illustrates  the  differences : 


TABLE  2 

EXAMPLES  OF  NATIVE  AND  BORROWED  WORDS 

Native  Word* 

Borrowed  Word* 

Man 

Person 

Finger 

Thinker 

Digit 

Philosopher 

Fire 

Conflagration 

Book 

Volume 

Dirtbag 

Miscreant 

Native  words  are  short,  strong  and  rough.  Borrowed  words  tend  toward 
length,  gentility  and  elegance.  Good  English  prose  (particularly  good  fiction)  favors 
native  words.  Poor  English  prose  (particularly  bad  technical  writing)  exhibits  a 
tendency  to  overutilize  latinate  etymology.  By  counting  up  the  number  of  native  and 
borrowed  words  in  a  user's  input  text,  The  Stylist  could  see  how  it  compares  to  good 
writing  of  the  appropriate  genre. 
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2.  Tangibility 

A  word  can  cither  evoke  an  image  of  a  thing  or  it  can  refer  to  an  idea.  I  call 
the  former,  "tangible"  and  the  latter,  "intangible".  The  following  table  illustrates  the 
difference: 


TABLE  3 

EXAMPLES  OF  TANGIBLE  AND  INTANGIBLE  WORDS 

tangible; 

INTANGIBLE 

Rock 

Lip* , 

Baahlva 

Corvetta 

Hammar 

Etharaal 

Automatically 

Than 

Preliminary 

Tangible  words  are  concrete,  exact  and  evocative.  Intangible  words  are  ideal, 
general  and  cognitive.  All  writing  uses  both.  Good  writing  usually  takes  advantage  of 
tangible  words.  Even  the  most  philosophical  writing  benefits  from  the  use  of  tangible 
words.  (See  Appendix  E  for  an  analysis  of  a  Platonic  dialogue).  Good  fiction  writing 
rurely  strays  too  for  toward  the  intangible,  The  Stylist  could  count  the  tangible  and 
intangible  words  of  the  user's  input  text  and  compare  this  count  to  good  writing  of  the 
appropriate  genre. 

3.  Difficulty 

This  is  one  of  the  most  indisputable  characteristics  of  words.  For  the 
purposes  of  The  Stylist,  I  defined  four  levels  of  difficulty  :  Elementary,  High  School, 
Graduate  and  Postgraduate.  The  words  of  each  level  are  those  most  likely  to  be  used 
in  speech  with  case  by  an  average  person  of  that  educational  level,  Although  I'm 
aware  that  reading,  writing  and  speaking  vocabularies  are  different,  1  contend  that  a 
person  reading  a  word  within  his  reading  vocabulary  but  outside  of  his  speuking 
vocabulary  often  must  pause  for  a  mental  translation  into  more  simple  terms.  The  act 
of  this  pausing  raises  the  difficulty  level  of  the  text.  Table  4  illustrates  the  categories. 
These  examples  are  based  upon  my  judgements. 

By  counting  the  occurrence  of  difficult  words,  The  Stylist  could  determine  the 
overall  difficulty  of  the  vocabulary  and  thus  the  readibility  of  the  text. 


TABLE  4 

EXAMPLES  OF  WORDS  OF  VARIOUS  DIFFICULTY 


ELEMENTARY 

Big 

Slater 

Spoon 

Flying 

Wish 

Done 

Handsome 


HIGH  SCHOOL 

Tardy 

Rendezvous 

Transmission 

Foremen 

Process 

Joyous 

Undergo 


GRADUATE 

Matrix 

Immaterial 

Tonality 

Universal 

Induction 

Processor 

Linear 


POSTGRADUATE 

Execrable 

Parsimony 

Recursive 

Homomorph 

Dilatory 

Zygote 

synergy 


4.  Emotional  Connotations 

The  Stylist  would  have  five  categories  of  emotional  connotation  :  Sublime, 
Pleasant,  Neutral,  Unpleasant  and  Horrid.  The  following  table  illustrates  the 
categories: 


TABLE  5 

EXAMPLES  OF  WORDS  OF  VARIOUS  EMOTIONAL  CONNOTATIONS 


SUBLIME 

PLEASANT 

NEUTRAL 

UNPLEASANT 

HORRID 

Baauty 

Sunriia 

Victory 

Lova 

God  , 

Par adi •• 

BSa. 

Sax 

fe 

Xa 

Nama 

Whan 

Damaga 

Loss 

Insulting 

Losar 

Cost , 

Wound 

Cancar 

Murdar 

Whora 

Fuck 

By  counting  the  use  of  these  types  of  words,  The  Stylist  could  determine  the  overall 
emotional  tone  of  the  passage.  It  could  also  detect  flat  or  emotional  writing. 

5.  Vigor 

A  related  but  distinct  characteristic  is  vigor.  Sublime  and  horrid  words  tend  to 
be  highly  vigorous,  but  not  all  highly  vigorous  words  arc  emotional.  Examples  of 
vigorous  but  unemotional  words  are  "sprint",  "rush"  and  "cross".  This  category  also 
tends  to  be  more  objective.  For  example,  the  word,  "soldier"  can  have  widely  different 
connotations  for  different  people.  The  word  "soldier"  would  please  a  career  Army 
officer  but  it  would  displease  a  survivor  of  the  Japanese  occupation  of  Canton,  Both 
could  agree  that  "solider"  is  a  vigorous  word. 


TABLE  6 

EXAMPLES  OF  WORDS  OF  VARIOUS 

VIGOR 

VIOLENT 

ENERGETIC 

CALM 

INERT 

Destroy 

Creation 

F  ire, 

Atomic 

Holocauat 

Conqueror 

Sprint 

Dive,  . , 

Steam 

Flying 

Wr?te 

Manager 

Ocean 

Blue 

Sought 

From 

Into 

Something 

Comma 

Paper 

Format 

By  calculating  the  vigor  of  the  words  of  the  user's  input  text,  The  Stylist 
would  be  able  to  estimate  its  overall  strength. 

B.  THE  NEED  FOR  A  DICTIONARY 

To  quantify  the  above  five  aspects  of  the  words  of  the  user's  Input  text,  It  would 
be  necessary  to  maintain  a  dictionary  of  the  most  commonly  used  words  and  their 
values.  The  original  idea  of  The  Stylist  forsaw  just  such  a  dictionary.  After  reading 
Anderson  [Ref.  12]  and  testing  PC*Style,  the  idea  seemed  less  original  but  still  valid. 
The  heart  of  The  Stylist,  therefore,  would  be  Its  dictionary.  Before  I  began  any  top- 
down  designing,  1  first  wanted  to  explore  the  technical  challenges  and  possible  pitfalls 
of  building  such  a  dictionary.  A  review  of  data  structure  literature  convinced  me  that 
two  approaches  were  the  most  feasible:  hashing  and  Binary  Search  Tree,  Since  I  had 
already  decided  that  The  Stylist  would  also  produce  a  concordance,  I  had  to  take  into 
account  the  need  for  efficient  alphabetical  traversal  of  the  words  of  the  user's  input 
text.  My  intuition  that  hashing  would  be  preferable  for  dlcticnaty  look-up  but  would 
not  lend  itself  easily  to  concordance-building  was  confirmed  by  a  pussage  in  an 
excellent  text  by  Donald  E.  Knuth,  [Ref.  15:  p,  540]  Having  decided  to  implement  the 
dictionary  with  a  Binary  Search  Tree  (BST),  I  began  to  build  a  series  of  prototypes. 

C.  THE  COPYDIX  PROTOTYPES 

CopyDixl  through  CopyDixd  were  early  experiments  in  using  a  Binary  Search 
Trees  for  dictionary  lookup,  Naturally,  the  word  was  the  key  value.  The  experience  of 
the  CopyDix  series  taught  me  the  following  lessons: 
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1.  Tha  Benefit  of  Preorder  Storage 

The  structure  of  a  Binary  Search  Tree  is  maintained  during  execution  as  a 
system  of  pointers.  These  pointers  refer  to  locations  in  memory  which  pertain  only  to 
that  execution.  If  a  dictionary  is  modified  during  execution  (for  example,  if  new  words 
are  added),  then  the  BST  should  be  stored  to  its  file  in  preorder.  An  inorder  storage 
seems  the  most  logical  way  but  it  is  actually  the  worst.  If  a  BST  is  stored  using  an 
inorder  traversal,  the  next  time  it  is  loaded  it  will  be  that  least  bushy  of  all  BSTs:  a 
linked  list.  Storage  using  preorder  will  cause  the  next  loading  of  the  tree  to  duplicate 
the  last. 

2.  The  Need  for  AVL 

Even  with  preorder  storage,  the  dictionary  needed  the  capability  to  change;  to 
grow  or  shrink  as  the  user  desired,  Any  particular  office  or  curriculum  has  its  own 
special  vocabulary.  1  wanted  to  give  The  Stylist  the  ability  to  adapt  its  dictionary  to 
the  vocabulary  of  its  user's  environment.  Without  some  mechanism  for  rebalancing 
the  BST  after  the  Insertion  of  new  words  or  the  deletion  of  unwanted  words,  the  BST 
could  become  increasingly  lopsided.  Searching  for  words  in  such  a  lopsided  tree  would 
become  inefficient,  Clearly,  an  AVL  scheme  was  required.  I  adapted  an  AVL  insertion 
routine  ftom  an  excellent  text  by  Nlkkus  Wirth.  IRef  16:  pps.  220  *  221) 

3.  Storage  requirements 

The  CopyDix  series  allowed  me  to  see  whether  a  BST  of  5000  words  and 
associated  values  would  fit  into  the  main  storage  available  to  a  healthy  personal 
computer.  I  determined  that  1.5  megabytes  would  be  sufficient  for  The  Stylist  and  its 
data  structures.  The  Copydix  series  allowed  me  to  create  such  a  structure  and  to  prove 
that  it  would  not  require  more  than  one  megabyte. 

4.  Text  Processing 

The  CopyDix  series  also  identified  some  unforscen  complexities  in  processing 
text,  For  example,  in  order  to  include  contractions,  it  was  necessary  to  include  the 
single  quotetion  mark  (')  in  the  dictionary,  Waterloo  Pasca)  uses  this  character  to 
define  the  beginning  and  end  of  user-defined  constants  and  strings,  To  include 
contractions,  it  was  necessary  to  declare  "Succ('@')"  instead  of  (')!  There  were  a  few 
other  similar  problems  requiring  equally  unhappy  solutions. 

5.  Building  a  Dictionary 

Finally,  the  CopyDlx  series  allowed  me  to  begin  to  build  a  dictionary  of  most 
frequently  used  words,  I  ran  the  text  of  two  novels  and  several  technical  articles 
through  the  Copydix  series,  accumulating  a  dictionary  of  over  2000  words, 


D,  CONCORDANCE  SEARCH  TREE  (CST) 

1.  Original  Concept 

Based  upon  the  experience  of  the  CopyDix  series,  I  began  to  play  around  with 
the  various  ways  of  building  a  concordance.  The  literature  provided  two  examples  of 
how  the  problem  has  already  been  solved.  One  solution  was  to  create  a  search  tree. 
The  key  value  of  the  search  tree  was  the  word;  two  associated  tables,  TOKEN  and 
TYPE,  recorded  the  information  about  the  sequence  of  occurrence  of  the  words. 
(Ref.  ?:  pp,  186  *  191]  Another  solution  familiar  to  most  computer  scientists  was  the 
Key  Words  in  Context  (KWIC)  program  designed  by  Parnas,  [Ref.  17]  This  solution 
creates  a  KWIC  listing  by  circular  shifting  each  line  and  then  sorting  each  line;  such  a 
KWIC  listing  is  similar  to  a  concordance, 

I  wanted  to  create  a  concordance  without  relying  upon  any  external  storage. 
My  experience  with  the  CopyDix  series  had  shown  that  text  processing  is  both  I/O  and 
computation  bound;  I/O  bound  because  large  text  files  must  be  loaded;  computation 
bound  because  each  word  must  be  processed  character  by  character.  I  wanted  the 
solution  to  eliminate  the  need  for  further  input  or  output  (such  as  using  secondary 
storage  to  build  files  containing  KWIC  lines).  I  also  wanted  to  minimize  computation 
by  allowing  The  Stylist  to  remember  the  order  of  occurrence  of  each  word  of  the  input 
text,  without  having  to  recompute  it.  I  wanted  my  solution  to  tlow  as  naturally  as 
possible  from  the  process  of  looking  each  word  up  in  the  dictionary.  I  didn't  want  to 
have  to  process  each  word  twice,  or  to  search  for  the  position  of  any  word  twice. 
Ideally,  the  Stylist  would  have  enough  main  storage  available  that  it  could  retain  all 
important  information,  without  resorting  to  recomputation  or  to  secondary  storage. 
These  ideas  led  me  play  to  around  with  various  ways  of  using  a  BST  to  record  the 
order  of  occurrence  of  the  words  of  the  user's  input  text.  One  obvious  solution  would 
be  to  have  each  node  of  the  BST  contain  pointers  to  associated  data  structures  such  as 
an  array  of  linked  lists  containing  the  sentences  of  the  input  text.  Such  a  solution, 
however,  would  require  storing  the  words  twice.  CopyDix  had  shown  that  words  take 
up  a  great  deal  of  storage  space.  Storing  them  twice  would  be  wastellil.  With  these 
ideas  in  mind,  I  lit  upon  the  idea  of  the  Concordance  Search  Tree  (CST). 

A  CST  is  a  Binary  Search  Tree  with  a  linked  list  threaded  through  it.  This 
linked  list  is  two-way,  connecting  each  word  of  the  BST  to  the  word  used  before  it  and 
to  the  word  used  after  it.  An  inorder  traversal  of  the  BST  visits  each  word  in 
alphabetical  order.  During  each  visit,  a  traversal  of  the  linked  list  in  the  "backwards" 


direction  would  encounter  all  the  words  used  before  that  word.  Printing  these  words, 
then  printing  the  hey  word  being  visited,  and  finally  printing  the  words  in  the 
"forwards"  direction  creates  a  concordance. 

This  solution  offers  some  exciting  facilities.  It  allows  the  user's  input  text  to 
be  accessed  in  any  number  of  ways.  If  a  concordance  of  three,  five,  seven  or  thirteen 
Vvords  per  line  is  desired,  changing  a  few  global  constants  immediately  fulfills  that 
desire.  If  only  the  sentences  containing  a  certain  word  are  desired,  a  insert  search  of 
the  CST  keying  on  that  word,  and  then  a  concordance  traversal  on  that  node,  provides 
all  those  sentences.  In  short,  a  CST  is  a  complex  structure  with  a  challenging  but 
strong  intuitive  appeal.  It  provides  a  great  deal  of  flexibility  to  the  processing  of  text. 

One  added  wrinkle  of  complexity  lies  in  the  fact  that  many  words  of  the  input 
text  occur  more  than  once.  The  word,  "the",  ror  example,  occurs  many  times.  To 
guarantee  a  correct  concordance  traversal  of  the  CST,  each  occurrence  of  the  word 
"the"  must  be  associated  with  a  unique  pointer.  This  requirement  is  satisfied  by 
creating  a  unique  node  of  pointers  associated  with  each  occurrence  of  the  word.  The 
,  main  node  of  the  CST  contains  the  word,  its  values,  and  left  and  right  pointers.  It  also 
contains  a  pointer  called  "down",  which  points  to  the  linked  list  of  unique  nodes  of 
pointers.  The  first  such  node  pertains  to  the  first  occurrence  of  the  word,  "the",  in  the 
user's  input  text.  It  points  to  the  word  used  before  and  the  word  used  afier  that 
occurrence  of  the  word  "the".  (These  pointers  arc  called  "last"  and  "next".)  It  also 
points  "down"  to  the  second  occurrence  of  the  word  "the".  So  the  Stylist  creates  a 
concordance  through  the  following  traversals:  an  inorder  traversal  of  the  CST  visits 
each  main  node  in  alphabetical  order.  During  that  visit,  The  Stylist  goes  down  the 
linked  list  of  the  occurrences  of  that  word.  During  each  step  down  the  linked  list,  The 
Stylist  traverses  Tustward"  and  then  "nextward"  to  print  out  the  line  of  that 
occurrence.  When  The  Stylist  reaches  the  bottom  of  the  downward  list,  it  continues 
the  inorder  traversal. 

2.  The  Tril  -  TH9  Prototypes 

To  examine  the  feasibility  of  the  CST  concept,  I  wrote  a  series  of  progressively 
more  capable  prototypes  called  Tril  through  Tri9.  (The  names  refer  to  a  "Trinary 
Search  Tree".)  I  found  that  I  could  implement  all  the  procedures  necessary  to  build 
and  traverse  a  CST  in  only  137  lines  of  code.  Moreover,  the  performance  of  the 
concept,  both  in  terms  of  main  storage  and  processor  requirements,  seemed  extremely 
satisfactory. 


E.  DESIGN  OF  THE  STYLIST 

Having  used  a  series  of  prototypes  to  identify  and  solve  the  critical  issues,  I  set 
them  aside  and  began  to  design  The  Stylist.  After  jotting  down  some  logic  flowcharts 
and  data  flow  diagrams,  I  decomposed  the  problem  into  three  main  modules  wrhlch  I 
called  "Reader-,  "Researcher"  and  "Reporter". 

1.  The  Reader  Module 

Only  Reader  has  access  to  the  user's  input  text.  It  reads  that  text  and  passes 
along  a  data  structure  called  "rcadnode",  which  is  merely  the  word  and  its  length.  The 
Reader  also  signals  the  ends  of  sentences  and  the  end  of  the  file. 

2.  The  Researcher  Module 

Researcher  receives  the  "readnodes",  researches  the  qualities  of  that  word, 
builds  a  profile  of  the  user's  input  text  based  upon  the  tally  of  the  qualities  of  those 
words,  and  then  passes  that  profile  on  to  the  Reporter  Module.  If  the  user  wants  one, 
Researcher  also  creates  the  concordance.  Only  Researcher  has  access  to  the  dictionary 
file  and  to  the  CST.  Neither  Reader  nor  Reporter  know  how  the  dictionary  is 
implemented  or  how  the  concordance  is  produced.  This  is  the  most  crucial  instance  of 
information  hiding  in  the  design, 

3.  The  Reporter  Module 

Reporter  receives  the  profile,  which  it  compares  to  tho  profiles  of  other  texts. 
Based  upon  this  comparison,  it  passes  on  recommendations  and  commendations  to  the 


4.  Low  Level  Design 

Having  decomposed  the  problem  into  these  three  modules,  I  jotted  down  a 
plan  for  the  procedures  that  would  comprise  them.  This  plan  included  the 
input/output  and  operations  of  each  procedure.  This  low  level  design  included  all  the 
parameters  of  the  procedures  and  identified  the  hierarchy  between  the  procedures  and 
modules. 

One  of  the  important  features  built  into  this  low  level  design  was  the  human- 
factor  facility  of  reporting  to  the  user  the  progress  of  the  execution  of  The  Stylist.  The 
need  to  send  messages  to  the  screen  telling  the  user  what  was  happening  required 
counting  steps  and  further  input/output,  but  my  experiences  with  RightWritcr,  PC- 
Style  and  the  CopyDix  prototypes  had  convinced  me  that  the  expense  was  worth  it. 
Psychologically,  staring  at  a  blank  screen  for  five  minutes  seems  ten  times  as  long  as 
reading  a  dynamic  screen  for  seven  minutes.  To  provide  the  user  the  option  of  quicker 
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but  more  boring  execution  over  slower  but  more  engaging  execution,  I  designed  a 
facility  that  asked  him  whether  he  wanted  frequent  reports,  occasional  reports  or  one 
report  when  done. 

Witn  this  low  level  design  in  hand,  I  was  ready  to  begin  to  code. 


IV.  IMPLEMENTATION  OF  THE  STVLIST 


A.  CODING 

Because  of  the  experience  gained  from  the  CopyDix  prototypes  and  because  of 
the  top-down,  modular  design,  coding  the  Reader  and  Researcher  modules  took  only 
two  weeks.  Prototyping  and  top-down  design  allowed  me  to  code  quickly  and  with  few 
errors.  (The  usefulness  of  these  software  engineering  techniques  would  also  prove  itself 
in  the  testing  phase.)  As  I  had  the  CopyDix  series,  I  implemented  The  Stylist  in  a 
series  of  iterative  steps  called  Stylel  through  Style9, 

Style6  brought  to  light  light  an  unforseen  problem:  the  interaction  between  the 
facilities  of  adding  new  words  to  the  dictionary  and  creating  a  concordance.  The 
design  called  for  allowing  the  user  to  decide  whether  he  wanted  to  add  new  words  to 
the  dictionary  and  whether  he  wanted  a  concordance.  These  two  choices  created  four 
cases:  Growing  Dictionary  and  Concordance;  Growing  Dictionary  and  no 
Concordance;  Static  Dictionary  and  Concordance;  Static  Dictionary  and  no 
Concordance.  In  the  case  of  Static  Dictionary  and  Concordance,  to  create  a 
concordance,  new  words  found  in  the  user's  input  text  would  have  to  be  added  to  the 
CST.  But  since  the  user  had  chosen  not  to  add  new  words  to  the  dictionary,  The 
Stylist  could  not  prompt  him  for  the  values  of  these  new  words..  This  was  not  a 
problem  until  the  same  new  word  was  used  again.  Researcher  would  find  this  new 
word  in  the  CST  but  would  not  find  any  values  for  the  word.  This  would  cause  a  fatal 
error:  "Word.etymology  has  unassigned  value,"  To  avoid  this  problem,  it  was 
necessary  to  add  a  new  field  to  each  of  the  nodes:  Status.  The  status  of  a  node  could 
be:  "Valuable",  which  meant  that  the  node  contained  values  for  the  other  fields; 
'' No  t_  Valuable",  which  meant  it  didn't, 

This  particular  problem  grew  even  more  gnarly  when  I  decided  to  extend  the 
Look_Up  procedure  of  Researcher.  The  orginal  design  called  for  Look_Up  to  try  to 
match  the  input  word  only  with  its  exact  equal  in  the  dictionary.  This  worked  fine,  but 
it  meant  that  the  dictionary  would  have  to  contain  all  the  following  variants  of  "look"  ; 
"look",  "looks",  "looked",  and  "looking".  The  extension  created  a  new  procedure  called 
Look_Up_ Variants.  If  the  input  word  didn't  match  any  word  in  the  dictionary, 
Researcher  looked  up  variants  of  the  word  If  it  found  one,  the  values  of  the  variant 


were  added  to  the  profile.  This  extension  allowed  a  more  compact  dictionary  and  it 
increased  th$  probability  that  most  words  would  be  found,  but  it  complicated  the 
problem  described  in  the  previous  paragraph.  In  the  case  of  Growing  Dictionary  and 
Concordance,  a  word  like  "looked"  would  have  to  be  added  to  the  CST  for  the  sake  of 
a  concordance,  but  now  it  would  also  be  added  to  the  dictionary.  Look_up_variants 
would  thus  eventually  gum  up  the  dictionary  with  a  lot  of  useless  variants.  Solving  the 
problem  meant  adding  a  third  status,  The  three  statuses  were  now  :  "Storable",  which 
meant  the  node  was  a  first  class  citizen,  a  word  which  had  values  and  should  be  stored 
to  the  dictionary;  "Not_Storablc",  which  meant  that  the  node  was  a  second  class 
citizen,  a  variant  word  which  had  values,  but  should  not  be  stored  to  the  dictionary; 
and  "Not_Valuable",  which  meant  the  word  was  a  third  class,  citizen,  a  new  word  which 
had  no  values,  nor  should  be  stored  to  the  dictionary. 

The  solution  of  these  problems  completed  the  coding  phaso  of  the  Reader  and 
Researcher  modules.  The  Reporter  module  remained  a  stub.  The  Stylist  could  not 
make  recommendations  and  commendations  to  the  user  until  it  had  established  its 
dictionary  and  built  up  a  history  of  other  profiles. 

B.  BUILDING  THE  DICTIONARY 

The  CopyDix  series  and  Style  l  through  Style7  had  built  up  a  dictionary  of  some 
3000  words,  which  contained  about  80%  or  85%  of  the  words  of  any  given  text.  Each 
of  these  3000  words,  however,  now  required  the  assignation  of  five  values :  etymology, 
tangibility,  difficulty,  emotional  connotation  ,and  vigor.  That  totals  to  15,000  values. 
After  experimenting,  I  developed  a  system  that  allowed  me  to  enter  those  15.000  values 
in  two  weeks.  First,  1  sorted  the  dictionary  alphabetically  and  printed  out  a  listing. 
Using  Funk  and  Wagnall's  Collegiate  dictionary,  I  checked  the  etymology  and 
highlighted  the  borrowed  words  on  the  print-out.  Then  I  wrote  a  program  called 
AUTOMATE,  which  displayed  a  screen  for  each  of  the  five  values  of  each  word  and 
required  one  keystroke  to  assign  a  value.  Using  this  system,  I  assigned  values  to  all 
3000  words,  going  at  a  speed  that  relied  upon  a  subjective  reaction  similar  to  the 
reaction  of  a  reader  scanning  a  text  at  a  comfortable  reading  pace.  Then  1  wrote  two 
new  programs,  DIXSPLIT  and  DIXJOIN.  DIXSPLKT  split  the  dictionary  into  17 
intersecting  subsets  :  all  native  words,  all  borrowed  words,  all  tangible  words,  all 
intangible  words,  and  so  on.  Each  of  these  subsets  was  like  a  formation  of  soldiers. 
Any  word  that  didn't  belong  in  that  subset  stuck  out  conspicuously.  After  correcting 
these  mistakes,  I  executed  DIXJOIN,  which  brought  the  amended  dictionary  together. 

Now  I  was  ready  to  begin  testing  The  Stylist. 
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V.  TESTING  OF  THE  STYLIST 


A.  INITITAL  TEST  DATA  SET 

The  initial  test  data  set  for  The  Stylist  included  both  fiction  and  technical  writing. 
Fiction  was  represented  by  long  passages  from  two  of  my  novels  and  1000-word 
excerpts  from  the  novels  of  Ernest  Hemingway,  Raymond  Chandler  and  Kurt 
Vonnegut.  Technical  writing  was  represented  by  about  a  dozen  student  essays  and 
excerpts  from  three  textbooks  on  computer  science.  In  both  categories,  I  included 
both  good  and  poor  writing.  I  ran  all  the  texts  through  Styles,  the  Reporter  module  of 
which  merely  printed  out  the  values  of  their  profiles.  Style8  performed  robustly.  I 
executed  it  under  all  four  cases  pertaining  to  the  dictionary  and  concordance  as  well  as 
under  all  three  cases  pertaining  to  the  frequency  of  execution  status  reports.  Style8 
revealed  no  major  flaws.  Its  shortcomings  were  well  within  the  scope  of  the  original 
design.  (See  Appendix  A,  Suggested  Extensions  to  The  Stylist).  Style8  suggested  some 
fme-tuning  changes. 

Two  changes  dealt  with  the  part  of  the  CST  scheme  called  the  FourDix.  The 
FourDix  contains  the  most  common  function  words,  all  of  which  are  four  letters  or  less 
in  length.  Examples  of  FourDix  words  are  'the",  "a",  "then",  "or",  "he",  "she",  and  so 
on.  The  FourDix  is  a  separate  CST.  Words  of  the  user's  input  text  of  four  or  less 
letters  in  length  arc  first  sought  for  in  the  FourDix.  This  is  an  economy  measure,  since 
a  large  portion  of  any  text  is  made  up  of  the  function  words  found  in  the  FourDix. 
Another  savings  of  the  FourDix  is  that  the  concordance  is  created  by  an  inorder 
traversal  of  the  main  CST.  This  means  that  the  Amctional  words  of  the  FourDix  are 
included  in  the  concordance  only  in  the  phrases  embracing  the  more  significant  words. 
Otherwise,  the  concordance  would  have  dozens  or  perhaps  hundreds  of  lines  showing 
each  use  of  words  like  "the".  The  first  fine-tuning  change  suggested  by  Style 8  dealt 
with  the  values  of  the  words  of  the  FourDix.  These  words  had  values,  of  course,  but 
they  tended  to  obscure  the  overall  picture.  Eliminating  their  values  from  the  tally  was 
the  first  solution.  I  modified  this  to  adding  only  those  values  of  personal  prom  uns, 
which  are  words  of  energetic  vigor,  This  is  similar  to  the  approach  of  PC-Style,  The 
second  fine-tuning  change  was  not  adding  the  length  of  the  FourDix  words  to  the  tally 
of  word  lengths.  With  the  FourDix  words,  a  bar  graph  of' word  length  usually  had  two 
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bumps :  one  tall  one  around  three  letters  and  another,  shorter  one  near  five  letters,  By 
eliminating  the  length  of  the  FourDix  words,  both  fiction  and  nonfiction  writing  always 
showed  a  bell-shaped  curve.  The  bell  of  fiction  writing  tends  to  be  a  tall  one  centered 
on  five  letters/word;  the  bell  of  nonfiction  ■writing  tends  to  be  a  shorter  one  centered  on 
six  letters/word.  All  this  means  is  that  fiction  writers  use  shorter  words  and  words  of 
more  uniform  length,  whereas  nonfiction  writers  use  longer  words  and  words  of  more 
variable  length,  As  a  stylistic  measure,  its  use  is  limited.  As  a  bar  graph,  it  is  a  lot  of 
flm. 

In  any  case,  the  profiles  obtained  allowed  me  to  examine  the  true  usefulness  of 
The  Stylist.  ThankMy,  The  Stylist  was  able  to  distinguish  between  the  novels  of 
Ernest  Hemingway  and  the  ramblings  of  computer  science  graduates.  The  profiles  of 
each  text  corresponded  closely  to  my  subjective  impressions.  (This  is  not  surprising 
since  the  values  of  the  dictionary  were  also  the  product  of  my  subjective  decisions.  In 
this  sense,  The  Stylist  is  an  expert  system  that  has  automated  my  own  rules  and  sense 
of  style.  The  facility  of  allowing  the  dictionary  to  change,  however,  allows  any  user  to 
adapt  The  Stylist  to  his  own  taste), 

I  now  could  finish  coding  Reporter.  Reporter  now  took  the  profile  and 
manipulated  its  values  to  create  an  Analysis.  This  manipulation  took  into  account  the 
genre  of  the  user's  input  text  and  involved  simple  weighting  factors  rather  than  any 
complex  statistical  methods.  Like  the  original  values  of  the  words  of  the  dictionary, 
these  weighting  factors  depended  upon  my  own  subjective  impressions,  but  they  also 
derived  from  the  results  of  the  test  data  set  runs.  If  a  technical  paper  had  four  times  as 
many  intangible  words  than  tangible  words,  it  read  like  so  much  mush.  So  I 
incorporated  a  ratio  of  4:1  as  the  limit  of  intangible: tangible  for  nonfiction.  I  tried  to 
be  conservative  in  these  weightings,  because  I  always  envisioned  The  Stylist  as  a 
descriptive  rather  than  a  prescriptive  tool.  The  Stylist  thus  rebukes  the  user  only  when 
the  qualities  of  his  text  fall  far  outside  the  bounds  of  the  norm. 

B.  FIELD  TESTS 

The  Stylist  was  finally  ready  for  field  testing.  This  final  phase  was  meant  to 
mimic  the  introduction  of  the  software  product  for  consumer  use.  The  field  testing 
took  place  in  two  parts.  In  the  first,  two  computer  science  graduates  were  briefed  on 
the  program  and  asked  to  use  it.  In  the  second,  22  papers  for  an  Administrative 
Science  composition  course  were  run  though  The  Stylist. 


30 


1 ,  Field  Test  One :  The  Ape  Test 

The  two  computer  science  graduate  students  evaluated  the  program  by 
unleashing  it  on  some  of  their  own  writing.  Their  overall  reaction  was  highly  positive, 
possibly  because  The  Stylist  seemed  to  approve  of  their  writing.  They  praised  the 
human  factors  and  the  graphics.  They  thought  that  the  breakdowns  of  the  values  and 
the  concordances  were  intriguing  and  offered  them  insights  into  their  own  writing. 
They  complained  that  the  report  didn't  explain  the  meanings  of  some  of  the  values. 
They  also  didn't  like  the  need  to  remove  embedded  commands  from  the  input  text.  I 
addressed  these  complaints  by  adding  some  explanations  to  the  report  and  by  changing 
Reader  so  that  it  didn't  signal  End_of_Sentence  when  it  encountered  an  embedded 
command. 

2.  Field  Test  Two  !  The  Writing  Class 

A  Naval  Postgraduate  School  Administrative  Science  composition  class 
provided  22  short  papers  for  testing.  These  students  are  almost  all  military  officers 
who  are  considered  by  their  services  as  top  performers.  As  such,  they  constitute  a 
somewhat  literate  and  professional  test  group.  Their  fairly  uniform  papers  were 
apparent  attempts  to  incorporate  some  of  the  lessons  of  good  business  communication. 
I  ran  their  papers  through  The  Stylist,  then  provided  the  reports  to  the  Professor,  who 
gave  them  to  the  students.  I  never  met  the  students;  neither  did  they  ever  use  The 
Stylist  themselves.  Of  the  22  students,  12  filled  out  a  questionnaire  detailing  their 
reactions  to  the  reports. 

Most  (10  out  of  12)  had  never  heard  of  a  style  checker  before.  Most  (11  out 
of  12)  understood  the  reports.  One  student  thought  The  Stylist  was  worthless;  11 
found  some  of  its  features  helpfUl.  Six  students  found  the  measures  of  word  length  and 
sentence  length  helpful.  Nine  found  the  measures  of  vigor,  etymology  and  difficulty 
helpful.  Six  liked  the  concordance;  four  didn't  think  it  was  useful.  Interestingly,  only 
five  thought  The  Stylist  accurately  reflected  the  qualities  of  their  writing;  three  were 
sure  it  didn't;  three  just  weren't  sure.  One  common  complaint  was  that  they  weren't 
sure  of  the  meanings  of  some  of  the  categories. 

Overall,  this  test  underscored  the  need  for  a  user's  manual.  Appendix  B  is  just 
such  a  manual.  If  this  manual  had  been  available  for  the  students  or  if  they  had  had 
more  experience  using  The  Stylist,  then  they  would  have  better  understood  the 
meaning  of  the  reports.  Despite  this  drawback,  it  was  obvious  from  the  comments  in 
the  questionnaire  that  the  majority  of  the  students  liked  The  Stylist  and  would  want  to 


UM  it  or  similar  programs  to  analyze  their  writing.  This  test,  therefore,  indicated  that 

The  Stylist  would  be  able  to  find  a  place  in  the  classroom  and  possibly  even  the 
market. 
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VI.  CONCLUSIONS 


Researching,  designing,  coding  and  testing  The  Stylist  was  an  excellent  academic 
exercise.  It  brought  home  many  of  the  lessons  of  software  engineering.  It  sharpened 
my  skills.  The  final  product.  The  Stylist,  seems  a  success  in  that  its  affective  modeling 
works.  As  such,  The  Stylist  should  serve  as  a  contribution  to  the  development  of  other 
style  checkers,  rather  than  a  stand-alone  style  checker  itself.  As  to  whether  writers, 
teachers  and  students  should  use  computers  to  analyze  writing,  I'm  reminded  of  the 
following  passage  from  Plato's  "Phaedrus": 

Socrates : 

At  the  Egyptian  city  of  Naucratis,  there  was  a  famous  old  god,  whose  name 
was  Theuth  ...  his  great  discovery  was  the  use  of  letters.  Now  is  those  days  the 
god  Thamus  was  the  king  of  the  whole  country  of  Egypt  ...  To  him  came 
Theuth  and  showed  his  inventions  .  .  .  when  they  came  to  letters,  This,  said 
Theuth,  will  make  the  Egyptians  wiser  and  give  them  better  memories;  it  is  a 
specific  both  for  the  memory  and  for  the  wit.  Thamus  replied:  O  most  ingenious 
Theuth,  the  parent  or  inventor  of  an  art  is  not  always  the  best  judge  of  the  utility 
or  inutility  of  his  own  inventions  to  the  users  of  them.  And  in  this  instance,  you 
who  are  the  father  of  letters,  from  the  paternal  love  of  your  own  children  have 
been  led  to  attribute  to  them  a  quality  which  they  cannot  have;  for  this 
discover}'  of  yours  will  create  forgetfulness  in  the  learners'  souls,  because  they 
will  not  use  their  memories;  they  will  trust  to  the  external  written  characters  and 
not  remember  of  themselves.  The  specific  which  you  have  discovered  is  not  an 
aid  to  memory,  but  to  reminiscence,  and  you  give  your  disciples  not  truth,  but 
only  the  semblance  of  truth;  they  will  be  hearers  of  many  things  and  will  have 
learned  nothing;  they  will  appear  to  be  omniscient  and  will  generally  know 
nothing;  they  will  be  tiresome  company,  having  tne  show  of  wisdom  without  the 
reality.  [Ref.  18] 

In  the  fourth  century  before  Christ,  wise  Athenians  were  debating  the  uses  and 
evils  of  literature  itself.  As  Plato  points  out  in  the  above  passage  toward  the  end  of  his 
dialogue,  literature,  the  act  of  writing  down  our  ideas,  can  have  its  pitfalls.  It  can 
weaken  our  memories.  (Let  anyone  who  has  never  forgotten  where  he  parked  his  car 
argue  this  point.)  It  can  also  lead  to  intellectual  cheating.  The  wisdom  of  a  man 
capable  of  piecing  together  an  article  with  the  aid  of  a  library  and  a  long  weekend  may 
indeed  be  less  than  the  wisdom  of  a  man  who  can  stand  before  a  learned  crowd  and 
speak  an  intelligent  discourse. 


Nevertheless,  civilization  has  embarked  on  a  course  inseparable  from  literature 
and  writing.  Wealth,  power  and  knowledge  has  followed  the  progress  of  the  written 
word.  Literature  has  made  us  strong,  but  to  the  extent  that  we  rely  upon  books 
Instead  of  our  minds,  we  have  grown  weak.  Now,  at  what  we  call  the  dawn  of  the 
information  age,  we  would  do  well  to  remember  the  reservations  of  Plato.  Electronic 
computation  offers  us  dazzling  abilities,  but  to  the  extent  we  rely  upon  it  instead  of  our 
minds,  we  will  grow  weak. 

Therefore,  any  style  checker  or  CACI  program  should  be  used  as  an  interesting 
tool  that  provides  a  fresh  perspective  on  writing.  These  programs  cannot  take  and 
should  not  be  put  in  the  place  of  thoughtfltl  readers,  editors,  teachers  and  friends. 


APPENDIX  A 

SUGGESTED  EXTENSIONS  TO  THE  STYLIST 


1.  ABBREVIATIONS 

Reador's  Readjntext  procedure  ignores  embedded  commands  such  as 
".embedded"  by  keeping  count  of  the  number  of  words  read  since  the  end  of  the  last 
sentence.  If  only  one  word  has  been  read  when  another  period  is  encountered, 
Readjntext  does  not  signal  end^of^sentence,  This  simple  feature  also  allows 
Readjntext  to  ignore  the  ellipsis  It  doesn't  always  allow  it  to  ignore 

abbreviations  such  as  "Mr.",  "Dr",  "etc.",  or  "e.g,".  If  these  abbreviations  appear  as 
the  first  word  of  this  sentence,  Readjntext  doesn't  signal  end_of.sentence,  If  the 
abbreviation  appears  somewhere  in  the  middle  of  the  sentence,  Readjntext  does  signal 
end_of_sentence.  An  example  is  the  following  sentence  :  "You  should  have  told  Prof, 
Wu  about  this  earlier."  Readjntext  would  signal  end_of_sentence  at  "Prof."  and 
"earlier.",  This  is  a  shortfall  of  the  Reader  module,  but  one  which  I  haven’t  corrected 
for  four  reasons:  first,  such  occurrences  are  relatively  rare;  second,  breaking  the 
sentences  into  two  always  favors  the  user;  third,  the  length  of  sentence  measure  is  not 
the  thrust  of  The  Stylist;  fourth,  the  solution  would  involve  unwanted  overhead  in 
computation. 

Writer's  Workbench  solves  this  problem  for  at  least  48  abbreviations. 
Presumably,  every  time  a  period  is  encountered,  Writer's  Workbench  checks  a 
dictionary  of  these  48  abbreviations,  If  the  word  preceding  the  period  matches  one  of 
these  abbreviations,  then  Writer's  Workbench  does  not  count  this  as  an  end  to  a 
sentence.  A  similar  solution  could  be  implemented  for  The  Stylist. 

2.  TRANSLATION  TO  TURBO  PASCAL 

The  design  of  The  Stylist  kept  in  mind  the  working  environment  of  a  personal 
computer  with  1.5  megabytes  of  main  storage.  A  good  follow-on  project  for  The 
Stylist  would  be  to  translate  the  code  from  Waterloo  Pascal  to  Turbo  Pascal,  Copies 
of  this  program  could  then  be  distributed  and  possibly  even  marketed. 

3.  IMPRO  VEMENTS  OF  THE  CONCORDANCE 

In  addition  to  the  main  CST  and  the  FourDix,  a  third  CST  could  be  added.  This 
new'  CST's  node's  key  values  would  be  characters  of  punctuation.  During  the 


execution  of  InorderJUoncordance,  the  procedure  that  creates  the  concordance,  the 
traversals  "lastward"  and  "nextward"  could  terminate  upon  encountering  a  character  of 
punctuation  in  this  third  CST,  This  would  mean  the  lines  of  the  concordance  would 
only  contain  phrases  extracted  from  single  sentences,  This  may  or  may  not  be  a 
desirable  feature,  i  myself  found  it  usefhl  to  read  entire  phrases,  even  when  they 
overlapped  into  preceding  or  succeeding  sentences. 

More  sophisticated  concordances  include  at  the  end  of  each  line  a  note  as  to 
where  this  line  can  be  found.  The  third  CST  outlined  above  could  contains  as  a  field  in 
its  tnain  node  just  such  a  note,  with  the  name  of  the  user's  input  text,  the  page  number 
and  the  line  number.  Such  an  extension  would  be  valuable  for  scholarly  research.  For 
the  short,  single  input  texts,  such  notes  are  not  necessary. 

”4.  PARTS  OF  SPEECH 

A  considerably  more  ambitious  extension  would  be  to  provide  The  Stylist  the 
facility  of  determining  the  parts  of  speech.  Other  programs  [Ref.  8)  [Ref.  9]  [Ref.  10] 
solving  ,  this  problem  use  thousands  of  rules.  The  disambiguation  of  parts  of  speech 
,  would  raise  The  Stylist  to  a  whole  new  order  of  complexity,  It  would  also  allow  more 
fine  analysis  of  the  characteristics  of  the  words,  For  example,  "like"  as  a  verb  has 
much  different  connotations  than  “like"  as  a  preposition.  This  facility  would  also 
allow  other  stylistic  measures  to  figure  into  the  profiles,  such  as,  verb-adjective  ratio 
and  percentage  of  modifiers, 

5.  PASSIVE  VOICE 

The  Researcher  module  could  be  extended  to  detect  the  use  of  passive  voice, 
ideally,  this  would  be  accv'mplished  in  the  context  of  the  extension  involving  parts  of 
speech,  so  that  active  verbs  would  also  be  detected.  A  simpler  solution  is  readily 
available,  however.  Researcher  could  raise  a  flag  every  time  it  receives  a  form  of  the 
verb,  “to  be":  that  is,  "is",  "are",  "am",  "was",  "were",  "being",  and  "been".  This  flag 
would  remain  raised  for  the  next  three  words.  If  any  of  these  three  words  ended  in 
"-ed",  then  Researcher  would  count  this  as  an  example  of  the  use  of  the  passive  voice. 
Again,  I  haven't  implemented  this  solution  because  it  is  not  central  to  the  concept  of 
affective  modeling  and  because  I  didn't  want  to  pay  for  the  extra  computation. 

6.  CORRECTION  OF  POOR  DICTION  AND  GRAMMAR 

Another  ambitious  extension  of  The  Stylist  would  be  to  provide  it  with  the 
facility  of  correcting  poor  diction  and  grammar.  Both  RightWritcr  and  DICTION  of 


Writer's  Workbench  contain  dictionaries  of  commonly  used  cliches  or  repetitive 
redundancies.  Some  examples  from  DICTION  are  :  “a  great  deal  of,  "in  regards  to", 
"make  adjustments  to"  and  so  on.  One  solution  for  The  Stylist  would  be  to  include  a 
new  field  in  the  dictionary.  Words  like  "deal",  "regards"  and  "adjustments"  would 
contain  in  this  field  a  pointer  to  a  table  containing  the  poor  phrases  and  their 
replacements.  Such  a  solution  would  also  require  Researcher  to  maintain  a  phrase  of 
the  three  or  four  most  recently  received  words,  so  that  it  could  verify  that  "regards" 
appeared  in  the  offending  context  of  "in  regards  to". 

Grammatical  corrections  would  require  an  even  more  ambitious  extension.  To 
perform  properly,  such  a  facility  would  have  to  be  an  extension  of  the  parts  of  speech 
extension.  Determining  agreement  between  subject  and  verb,  for  example,  would 
require  first  the  identification  of  the  subject  and  the  verb.  Frankly,  it  would  be  easier 
to  incorporate  the  affective  modeling  of  The  Stylist  into  a  more  extensive  program  such 
as  Writer's  Workbench  than  it  would  be  to  extend  The  Stylist  this  far. 


APPENDIX  B 
USER'S  MANUAL 


1.  INTRODUCTION 

The  Stylist  is  a  Waterloo  Pascal  program  that  analyzes  the  style  of  English  prose, 
both  fiction  and  nonfiction.  The  Stylist  package  contains  the  following  :  this  User's 
manual,  and  the  following  electronic  files  :  the  Waterloo  Pascal  code  of  The  Stylist 
itself,  the  text  file  dictionary  of  The  Stylist,  and  two  Waterloo  Pascal  programs  for 
helping  to  maintain  the  dictionary,  DIXSPLIT  and  DIXJOIN, 

The  ftindamental  idea  of  The  Stylist  is  that  individual  words  have  power.  The 
denotation  of  a  word  is  its  meaning.  The  connotation  of  a  word  is  its  emotional 
impact.  For  example,  "pupil",  "student", and  "scholar"  all  denote  a  person  who  studies 
and  learns.  The  emotional  connotations,  however,  range  from  the  humble  "pupil" 
through  the  familiar  "student"  to  the  lofty  "scholar".  Besides  these  emotional 
connotations,  words  have  other  intrinsic  values  which  can  be  quantified.  The  Stylist 
concentrates  on  the  following  values:  Etymology,  Tangibility,  Difficulty,  Emotional 
Connotation  and  Vigor, 
a.  Etymology 

One  of  the  beauties  of  the  English  language  is  that  its  vocabulary  embraces 
two  main  sources  :  native  and  borrowed.  Our  native  words  mainly  come  to  us  from 
the  Anglo-Saxon  tongues.  Borrowed  words  come  from  Latin,  mostly  by  way  of  the 
French  of  the  Norman  conquest.  See  Table  7,  which  illustrates  the  difference. 


TABLE  7 

EXAMPLES  OF  ETYMOLOGY 


Native  Words  Borrowed  Words 


Man 

Finger 

Thinker 

Fire 

Book 

Dirtbag 


Person 

Digit 

Philosopher 

Conflagration 

Volume 

Miscreant 


Native  words  tend  to  be  short,  strong  and  rough.  Borrowed  words  tend  to  be 
long,  mild  and  elegant.  Good  English  prose,  particularly  good  fiction,  tends  to  use 
native  words.  Poor  English  prose,  particularly  bad  technical  writing,  exhibits  a 
tendency  to  overutili2e  latinate  etymology.  By  counting  up  the  number  of  native  and 
borrowed  words  in  your  input  text,  The  Stylist  sees  how  it  compares  to  good  writing  of 
the  appropriate  genre, 
b,  Tangibility 

A  word  can  either  evoke  an  image  of  a  thing  or  it  can  refer  to  an  idea.  The 
former  words  are  "tangible"  and  the  latter  are  "intangible”.  Tabic  8  illustrates  the 
difference. 


TABLE  8 

EXAMPLES  OF  TANGIBILTY 

TANGIBLE 

INTANGIBLE 

Rock 

Lips. 

Beehive 

Corvette 

Hammer 

Ethereal 

Automatically 

Then 

Preliminary 

Tangible  words  are  concrete,  exact  and  evocative.  Intangible  words  are  ideal, 
general  and  cognitive,  All  writing  uses  both.  Good  writing,  however,  usually  takes 
advantage  of  tangible  words.  Even  the  most  philosophical  writing  benefits  flora  the 
use  of  tangible  words.  Good  fiction  rarely  strays  too  far  toward  the  intangible.  The 
Stylist  counts  the  tangible  and  intangible  words  of  your  input  text  and  compare  this 
count  to  good  writing  of  the  appropriate  genre, 
c.  Difficulty 

This  is  one  of  the  most  indisputable  characteristics  of  words,  The  Stylist 
defines  four  levels  of  difficulty  ;  Elementary,  High  Schdoi,  Graduate  and  Postgraduate, 
The  words  of  each  level  are  those  most  likely  to  be  used  in  speech  with  ease  by  an 
average  person  of  that  educational  level,  Table  9  illustrates  the  categories. 

By  counting  the  occurrences  of  difficult  words,  The  Stylist  could  determine  the 
overall  difficulty  of  the  vocabulary  and  thus  the  readibiiity  of  your  text. 
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TABLE  9 

EXAMPLES  OF  DIFFICULTY 

ELEMENTARY 

HIGH  SCHOOL 

GRADUATE 

POSTGRADUATE 

Big 

Sister 

Spoon 

Flying 

Wish 

Done 

Handsome 

Tardy 

Rendezvous 

Transmission 

Foreman 

Process 

Joyous 

Undergo 

Matrix 

Immaterial 

Tonality 

Universal 

Induction 

Processor 

Linear 

Execrable 

Parsimony 

Recursive 

Homomorph 

Dilatory 

Zygote 

Synergy 

d.  Emotional  Connotations 

The  Stylist  defines  five  categories  of  emotional  connotation  :  Sublime, 
Pleasant,  Neutral,  Unpleasant  and  Horrid.  Table  10  illustrates  the  categories. 


TABLE  10 

EXAMPLES  OF  EMOTIONAL  CONNOTATION 

SUBLIME 

PLEASANT 

NEUTRAL 

UNPLEASANT 

HORRID 

Beauty 

Sunrise 

Victory 

Love 

God 

Paradise 

Happy 

Food 

Friendly 

Warm 

Helpful 

Sex 

The 

Which 

Brick 

Is 

Name 

When 

Damage 

Loss 

Insulting 

Loser 

Cost 

Wound 

Cancer 

Murder 

Whore 

Fucks 

Death 

Traitor 

By  counting  the  use  of  these  types  of  words,  The  Stylist  could  determine  the  overall 
emotional  tone  of  your  passage.  It  could  also  detect  flat  or  highly  emotional  writing, 
e.  Vigor 

A  related  but  distinct  characteristic  is  vigor,  Sublime  and  horrid  words  tend  to 
be  highly  vigorous,  but  not  all  highly  vigorous  words  are  emotional.  Examples  of 
vigorous  but  unemotional  words  are  "sprint",  "rush"  and  "cross".  This  category  also 
tends  to  be  more  objective.  For  example,  the  word,  "soldier"  can  have  widely  different 
connotations  for  different  people.  The  word  "soldier"  would  please  a  career  Army 
officer  but  it  would  displease  a  survivor  of  the  Japanese  occupation  of  Canton.  Both 
could  agree  that  "solider"  is  a  vigorous  word. 


TABLE  11 

EXAMPLES  OF  VIGOR 

VIOLENT 

ENERGETIC 

CALM 

INERT 

Destroy 

Creation 

Fire 

Atomic 

Holocaust 

Conqueror 

Sprint 

Dive 

Discipline 

Wedding 

Steam 

Flying 

Read 

Write 

Manager 

Ocean 

Blue 

Sought 

From 

Into 

Something 

Comma 

Paper 

Format 

By  calculating  the  vigor  the  words  of  your  input  text,  The  Stylist  would  be 
able  to  estimate  its  overall  strength, 


2.  USING  THE  STYLIST 

The  first  step  to  using  the  Stylist  is  to  save  your  piece  of  writing  as  "Input  text 
a".  You  should  be  aware  of  the  following  principles: 

1.  Your  input  text  should  be  at  least  500  words  long,  The  Stylist  will  examine 
shorter  texts,  but  the  statistical  sample  of  shorter  texts  is  too  small  for  valid 
analysis. 

2.  Your  input  text  should  not  be  longer  than  500  sentences  (about  10,000  words), 
Such  long  texts  require  so  much  computation  that  most  operating  systems  such 
as  MVS  will  terminate  execution  before  completion, 

3.  The  Stylist  is  not  meant  to  analyze  non-prose  constructions  such  as  tables,  lists, 
references  and  bibliographies.  You  should  eliminate  these  from  your  input  text. 

In  the  multiprocessing  environment,  DEFINE  MAIN  STORAGE  at  1500k.  The 
VM/CMS  command  is  "Define  storage  1500k"  followed  by  "I  CMS". 

Once  your  input  text  is  properly  stored,  execute  The  Stylist.  DO  NOT  interrupt 
The  Stylist  during  compilation  or  execution  by  hitting  any  keys.  Always  wait  for  The 
Stylist  to  display  a  screen  with  a  prompt.  After  The  Stylist  is  compiled,  you  will  see 
just  such  a  series  of  screens  that  will  ask  you  for  information.  The  following 
discussions  will  help  you  decide  how  to  answer. 

a.  Name  of  Intext 

The  first  screen  will  ask  you  for  the  name  of  your  input  text.  Simply  type  in 
the  name.  You  can  use  any  characters,  The  maximum  length  of  the  name  is  40 
characters.  Hit  "enter". 
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b.  Fiction  or  NonFiction 

The  screen  will  automatically  clear.  A  new  screen  will  ask  you  if  your  input 
text  is  fiction  or  nonfiction.  Enter  "1“  or  "2".  If  you  enter  any  other  character,  The 
Stylist  will  merely  ask  you  again, 

c.  Report  Frequency 

The  next  screen  asks  whether  you  want  reports  "frequently",  "seldomly"  or 
"when  done".  A  report  is  a  screen  which  The  Stylist  displays  to  you  during  execution, 
telling  you  what  it's  ioing,  how  much  it's  done  and  how  much  remains  to  be  done. 
The  act  itself  of  producing  such  takes  time  ;  the  more  frequent  the  reports,  the  longer  it 
takes.  You  should  probably  enter  "2"  for  "Seldomly".  Most  users  find  this  the  most 
agreeable. 

d.  Concordance 

The  next  screen  asks  whether  or  not  you  want  a  concordance.  A  concordance 
is  an  alphabetical  listing  of  all  the  significant  words  of  your  input  text.  ("Significant" 
words  are  all  words  other  than  functional  words  such  as  "the",  "and",  "as",  and  so  on.) 
Each  line  of  the  concordance  contains  the  key  word  and  the  phrases  preceding  and 
succeeding  it.  The  following  is  an  example  of  a  concordance. 

them  and  thamus  enquired  ABOUT  their  several  uses  and 

discovered  is  not  an  AID  to  memory  but  to 

other  egyptians  might  be  ALLOWED  to  have  the  benefit 

an  art  is  not  ALWAYS  the  best  judge  of 

is  called  by  them  AMMON  to  him  came  theuth 

learned  nothing  they  will  APPEAR  to  be  omniscient  and 

censured  others  as  he  APPROVED  or  disapproved  of  them 

many  arts  such  as  ARITHMETIC  and  calculation  and  geometry 

or  inventor  of  an  ART  is  not  always  the 

the  inventor  of  many  ARTS  such  as  arithmetic  and 

blame  of  the  various  ARTS  but  when  they  came 

calculation  and  geometry  and  ASTRONOMY  and  draughts  and  dice 

have  been  led  to  ATTRIBUTE  to  them  a  quality 

in  the  learners’  souls  BECAUSE  they  will  not  use 

your  own  children  have  BEEN  led  to  attribute  to 


.  Most  users  find  that  a  concordance  offers  them  interesting  insights  into  their 
use  of  words.  The  only  drawback  of  producing  a  concordance  is  that  it  takes  The 
Stylist  additional  time.  If  you  elect  this  option,  your  concordance  will  be  printed  to  a 
separate  file  called,  "Concrdnc  text  a". 

e.  Number  of  Words  in  Intext 

The  next  screen  asks  you  for  the  number  of  words  in  your  text.  If  you  don't 
know,  make  any  guess,  This  number  is  only  used  in  the  screens  displaying  how  far 
along  The  Stylist  is  in  reading  the  text,  Your  guess  will  have  no  effect  on  the  analysis 
of  the  text,  The  Stylist  counts  the  words  itself  and  will  report  the  exact  number  to  you 
when  it  finishes, 

f.  Expand  the  Dictionary 

The  next  screen  will  ask  whether  you  want  to  expand  the  dictionary  of  The 
Stylist  or  not.  This  dictionary  is  a  listing  of  about  3200  of  the  most  commonly  used 
words,  together  with  five  values  for  each  word.  If  you  answer  "1"  for  "Yes",  the  Stylist 
will  ask  you  to  give  it  a  values  for  every  word  in  you  input  text  which  is  not  found  in 
the  dictionary.  It  will  then  add  these  new  words  and  their  values  to  the  dictionary,  If 
you  answer  "2"  for  "No",  then  The  Stylist  will  write  all  of  the  unknown  words  to  a  file 
called  "Newwords  text  a".  Reviewing  "Newwords"  will  give  you  an  idea  of  the  words 
that  you  use  which  aren't  in  the  dictionary. 

The  first  few  times  that  you  use  The  Stylist,  you  probably  should  answer  "2" 
for  "No".  Entering  the  values  for  the  new  words  can  be  tedious.  You  probably  want 
to  see  how  The  Stylist  works  before  you  begin  to  expand  or  modify  the  dictionary, 
After  you've  used  The  Stylist  for  a  while,  you'll  probably  want  to  begin  adding  some 
words  to  the  dictionary,  Words  peculiar  to  your  field  of  writing  or  the  jargon  of  your 
profession  are  examples  of  the  types  of  words  which  should  be  entered,  Adding  them 
will  tailor  The  Stylist  to  your  working  environment.  Keep  the  following  in  mind  ; 

***  Every  word  added  to  the  dictionary  is  another  word 
that  will  have  to  be  loaded  into  memory  during  each 
execution  of  The  Stylist.  Adding  many  words  will  slow 
down  execution  and  it  will  decrease  the  amount  of  space 
in  main  storage.  If  the  dictionary  grows  beyond  5000 
words,  you  should  weed  out  words  that  are  rarely  used 
in  your  type  of  writing,  To  do  this,  simply  delete 
that  line  from  the  file  "Dixonary  text  a".  *** 
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If  you  answer  "1*  for  "Yes",  as  the  Stylist  encounters  each  new  word  in  your 
text,  it  will  ask  if  you  want  to  add  this  particular  word  to  the  dictionary.  If  the  word  is 
a  misspelling  or  if  it  is  a  word  that  you  rarely  use,  don't  add  it.  Also,  you  should  know 
that  its  best  to  add  the  root  form  of  words.  For  example,  neither  "burns”  nor  “burned" 
nor  "burning"  should  be  added.  It's  much  better  to  add  "burn".  To  keep  the 
dictionary  as  compact  as  possible,  note  down  the  root  form  of  words  you  wish  to  add 
to  the  dictionary  and  append  them  to  the  end  of  the  next  text  you  run  through  The 
Stylist. 

The  Stylist  will  then  ask  you  for  the  values  of  the  words  you  do  want  to  add 
to  the  dictionary,  The  following  will  help  you  decide  how  to  answer  these  prompts : 

/.  Source 

If  you're  uncertain  about  the  etymology  of  the  word,  simply  look  it  up  in  a 
collegiate  dictionary,  Old  English,  Middle  English,  Norse,  German,  Old  German, 
Middle  German  and  all  Celtic  languages  are  considered  "Germanic".  Latin,  French, 
Spanish,  and  all  other  languages  are  considered  "Latin". 

2.  Difficulty 

Each  category  contains  those  words  which  are  most  likely  to  be  used  in 
speech  by  persons  of  that  educational  level.  If  you're  still  uncertain  of  the  meaning  of 
these  categories,  the  following  are  examples  of  the  categories : 

POSTGRADUATE  DIFFICULTY 

aggregate 

algorithm 

algorithmic 

ambiguity 

analogue 

applicative 

conceptualize 

conglomerate 

constriction 

consummate 

consummation 

convoluted 

convolution 


culminate 

cyclical 

digitize 


GRADUATE  DIFFICULTY 

accumulate 

acrid 

acute 

adhere 

adjoin 

allocate 

allude 

analogy 

analysis 

analyst 

ancestor 

ancestral 

annihilate 

apparatus 

appendage 

arc 

arch 

array 

HIGH  SCHOOL  DIFFICULTY 

abandon 

ability 

able 

abrupt 

absence 

absent 

absorb 

acceptable 

access 


accommodate 

accomplish 

accomplishment 

account 

accurate 

achievement 

activate 

adapt 

additional 

adjust 

advantage 

ELEMENTARY  DIFFICULTY 

about 

above 

accept 

acceptance 

ache 

across 

act 

action 

active 

activity 

actor 

add 

addition 

address 

admit 

advance 

adventure 

afraid 

after 

afternoon 

again 
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3.  Conor ot mess 

Does  this  word  name  a  thing  that  you  can  touch?  If  it  does,  then  answer 
"I"  for  "Tangible",  Otherwise,  answer  "2". 

4.  Emotional  Connotation 

What  feelings  does  this  word  evoke  in  you?  If  none,  answer  "3"  for  neutral. 
Otherwise,  answer  in  the  appropriate  category, 

5.  Vigor 

Vigorous  words  are  not  necessarily  verbs.  Words  like  "conquest”  are  highly 
vigorous,  "Violent"  vigor  means  "highly"  vigorous.  Some  of  the  words  in  the  "violent" 
category'  are  "creation",  "triumph"  and  "epiphany".  This  category,  as  does  emotional 
connotation,  calls  for  your  subjective  reaction,  Don't  be  afraid  to  "guess"  which  value 
is  appropriate.  Such  a  "guess"  is  in  fact  your  subjective  reaction, 
g.  The  End  of  Execution 

The  above  are  all  the  screens  which  prompt  you  for  answers.  All  the  rest  of 
the  screens  will  merely  inform  you  about  the  progress  of  the  execution,  When 
execution  ends,  you  should  look  into  the  following  files  :  "Newwords  text  a",  if  you 
decided  not  to  let  the  dictionary  expand;  "Concrdnc  text  a",  if  you  elected  to  have  a 
concordance  produced;  and  most  importantly,  "Report  text  a",  which  is  The  Stylist's 
report  on  your  text. 

3.  THE  MEANING  OF  YOUR  REPORT 
a.  Measures  of  Length 

The  first  two  pages  of  your  report  deals  with  the  number  of  words,  the  length 
of  words  and  the  length  of  sentences,  These  measures  are  always  interesting  but  not 
always  illuminating.  The  first  bar  graph  shows  the  distribution  of  the  length  of  the 
words  of  your  text.  Each  horizontal  column  represents  the  percentage  of  the  words  of 
your  next  which  were  of  so  many  letters  of  length.  The  overall  bar  graph  should  look 
like  u  bell-shaped  curve.  This  bar  graph  does  not  include  function  words  such  as  "a", 
"an",  "the",  "then"  and  "or".  The  blurb  below  the  bar  graph  explains  how  your  text 
compares  to  other  texts  of  the  same  genre.  The  only  clear  indication  of  prose  in 
trouble  would  be  a  dramatic  slewing  of  the  curve  to  the  right.  This  would  mean  that 
you're  using  too  many  windy  words,  Table  12  breaks  down  the  characterizations  of 
v  ord  length, 


TABLE  12 
WORD  LENGTH 


The  second  bar  graph  shows  the  length  of  the  sentences  of  your  input  text. 
Each  bar  represents  the  length  of  a  sentence.  You  can  use  this  graph  to  identify  run- 
on  sentences.  Overall,  this  bar  graph  should  resemble  the  skyline  of  a  modest  city  such 
as  San  Francisco.  If  it  resembles  New  York,  then  you  tend  to  write  long  sentences. 
Table  13  explains  the  characterizations  of  sentence  length. 


TABLE  13 

SENTENCE  LENGTH 

Characterization 

Lower  boundary  of  length 

Too  Long 

n 

Long 

Medium 

18 

15 

Short 

10 

Too  Short 

0 

One  of  the  measures  below  this  bar  graph  is  "Number  of  sweet  spots".  This  is 
the  number  of  sentences  ranging  between  9  and  19  words.  The  idea  here  is  that  most 
ideas  can  find  a  home  in  a  sentence  of  this  length.  Table  14  breaks  down  how  The 
Stylist  characterizes  the  modulation  of  your  sentences. 

Another  count  here  is  the  number  of  run-on  sentences.  The  Stylist  defines 
any  sentence  over  45  words  in  length  as  a  run-on.  The  Stylist  realizes  that  many  grand 
sentences  over  45  words  in  length  are  not  run-ons,  but  it's  willing  to  bet  that  most 
overlong  sentences  are. 


TABLE  14 
MODULATION 


Percentage  Characterization 


Good 

Average 

Poor 


b.  Measures  of  Word  Characteristics 
/.  Etymology 

The  next  item  in  the  report  is  the  number  of  Latinate  and  Germanic  words. 
Notice  that  these  two  numbers  will  not  add  up  to  the  total  number  of  words  in  the  text 
unless  every  word  in  the  text  happened  to  belong  to  the  dictionary.  The 
characterization  of  your  text  in  this  category  depends  on  the  ratio  of  latin  to  germanic 
words.  Good  writing  generally  tends  to  be  native.  Table  15  breaks  down  these 
characterizations. 


TABLF;  15 

CHARACTERIZATIONS  OF  ETYMOLOGY 

Characterization 

Ratio  of  Latin  to  German 
Fiction  NonFiction 

Vary  Borrowed 
Borrowed 

Mixed 

Native 

Very  Native 

2:1  3:1 

1.5:1  2:1 

1:1  1:1 

.5:1  .9:1 

<.5:1  <.9:1 

2.  Difficulty 

The  next  entry  shows  the  tally  of  the  numbers  of  words  of  the  various 
levels  of  difficulty,  followed  by  the  percentage  of  those  levels.  The  difficulty  of  the 
text's  vocabulary  should  be  appropriate  for  its  intended  audience.  Moreover,  you 
should  remember  than  many  outstanding  works  of  literature  convey  difficult  ideas 
without  resorting  to  difficult  words,  Table  16  describes  the  characterizations  of  this 
category, 
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TABLE  16 

CHARACTERIZATIONS  OF  DIFFICULTY 


Characterization  Causa 


Very  Hard 
Hard 

Challenging 

Easy 


%postgrad  >  0 
Sgrad  >  5 

/Jnighachool  >  10 
above  percentages 
do  not  pertain 


3,  Tangibility 

The  next  entries  show  the  total  of  the  tangible  and  intangible  words.  As  In 
the  etymology  category,  the  characterizations  depend  upon  the  ratio  between  these 
words  and  the  genre.  Generally,  good  writing  is  tangible. 


TABLE  17 

CHARACTERIZATIONS  OF  TANGIBILITY 


Characterization 

?Efc£fi,u,u 

Very  intangible 


Ratio 


of  Intangible  to  Tangible 
Fiction  Nonfiction 


>1.  5i  1 
1.  5s  1 
3:  1 


>2;  1 
2:  1 
4:  1 


4.  Emotional  Connotation 

The  next  breakdown  is  of  the  number  and  percentages  of  the  words 
belonging  to  the  various  categories  of  emotional  connotation.  Following  these  values 
is  something  called  the  "Index  of  Emotion".  The  following  is  the  formula  used  to 
derive  this  index : 

Index  ■  (Percent  Horrid  times  5)  + 

(Percent  Unpleasant  times  2)  + 

(Percent  Pleasant  times  2)  + 

(Percent  Sublime  times  5) 
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The  Index  of  Emotion  is  simply  a  number  that  conveys  a  sense  of  the  use 
of  strongly  emotional  words.  The  most  emotional  words,  Horrid  and  Sublime,  are 
weighted  the  most  heavily.  This  Index  allows  The  Stylist  to  compare  your  use  of 
emotional  words  to  the  use  found  in  other  texts.  Table  18  shows  the  thresholds  of 
such  uses. 


TABLE  18 

CHARACTERIZATIONS  OF  EMOTIONALITY 

Characterization 

Lower  Boundry  of  Index 

Rich 

Average 

13 

Poor 

0 

Following  this  characterization  is  another  one,  this  time  for  the  overall  tone 
of  your  text.  The  Stylist  compares  the  frequency  of  use  of  positive  words  to  the 
frequency  of  use  of  negative  words.  If  positive  words  prevail,  the  tone  is  positive.  If 
negative  words  prevail,  the  tone  is  negative.  If  the  two  balance  and  the  index  of 
emotion  is  high,  then  the  tone  is  characterized  as  a  balance  of  strong  positive  and 
strong  negative  emotions,  If  the  two  balance  and  the  index  of  emotion  Is  low,  then  the 
tone  is  characterized  as  bland. 

5.  Vigor 

The  final  breakdown  is  that  of  the  number  and  percentage  of  words  of  the 
various  categories  of  vigor.  Following  this  is  an  "Index  of  Strength",  which  is  a 
measure  similar  to  the  Index  of  Emotion.  The  following  is  the  formula  used  to  derive 
the  Index  of  Strength  : 

Index  of  Strength  -  (Percent  of  violent  words  times  10)  + 

(Percent  of  energetic  words  times  5)  +• 

(Percent  of  calm  words) 

This  Index  allows  the  Stylist  to  compare  your  use  of  vigorous  words  to  uses 
found  in  other  texts.  Table  19  explains  how  The  Stylist  uses  the  index  to  characterize 
your  text.  Most  good  writing  is  strong  or  lively.  Usually  only  action-packed  fiction 
reaches  the  upper  registers  of  "very  strong". 
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TABLE  19 

CHARACTERIZATIONS  OF  VIGOR 


Characterization 

Vary  Strong 

Strong 

Lively 

Weak 


Lower  Boundry  of  Index 

60 

50 

,20 

0 


6,  Recommendations 

The  final  section  of  your  report  includes  recommendations  and 
commendations.  The  Stylist  is  meant  to  be  a  descriptive  rather  than  a  prescriptive 
tool.  Therefore,  it  makes  recommendations  only  when  some  aspect  of  your  writing 
seems  well  outside  of  the  bounds  of  the  normal.  For  this  reason,  you  should  consider 
the  recommendations  of  The  Stylist.  They  apply  only  to  what  appear  to  be  extreme 
cases.  On  the  other  hand,  you  should  never  accept  these  recommendations  as  the 
pronouncements  of  some  oracle.  The  Stylist  is  merely  a  machine.  When  in  doubt, 
trust  your  own  instincts. 

4.  MAINTENANCE  OF  THE  STYLIST 

a.  Care  of  the  Dictionary 

Many  of  the  issues  regarding  the  care  of  the  dictionary  have  already  been 
covered.  Here  are  some  other  helpful  pointers : 

1.  Always  maintain  more  than  one  copy  of  the  dictionary.  Loss  of  the  dictionary’ 
renders  The  Stylist  useless. 

2.  Beware  of  changing  the  values  directly.  You  should  never  do  this  to  the 
dictionary  itself.  While  using  DIXSPLIT,  it  is  necessary  to  go  into  the  split 
files  and  change  the  codes  for  the  words  which  don't  belong  in  that  split  file.  If 
you  make  a  mistake,  DIXJOIN  will  merely  assign  the  most  common  value  for 
that  category. 

b.  Changing  the  Code  of  The  Stylist 

The  Stylist  comes  with  embedded  documentation.  Make  sure  that  you  read 
this  documentation  prior  to  changing  any  of  its  code.  You  should  read  the  full 
master's  thesis  pertaining  to  The  Stylist  prior  to  making  any  substantive  changes,5 

5Sce  Master's  Thesis :  The  Stylist  :  A  Pascal  Program  for  Analyzing  Prose  Style, 
by  Lt.  Thomas  C.  Cool,  USN,  Naval  Postgraduate  School,  Monterey,  California,  1987, 


• 

TABLE  20 

CODES  OF  THE  DICTIONARY 

First 

Etymology 

1( atin) 
g( erman) 

Second 

Tangibility 

t( angible) 
,i{  ntang,  ) 

COLUMN 

Third 

Difficulty 

e(  lement.  ) 
h  igh  stc.  ) 

pj  oat>gr. ) 

Fourth 

Emotion 

h(  orrid 
u(  nple. 
m,  eutr. 

si  ibf!:  ■ 

Fifth 

Vigor 

i(nert) 
c  aim) 

«  rierg, 
v(  iole.  ) 

Substantive  changes  should  also  bear  in  mind  the  modularization  of  the 
design.  Pleader  module  bears  foil  responsibility  for  reading  the  user's  input  text.  It 
passes  along  words  (in  lower  case)  and  their  length.  It  also  signals  the  ends  of 
sentences  and  the  end  of  the  hie.  Researcher  Module  bears  foil  responsibility  for 
maintaining  and  using  the  dictionary,  for  building  up  the  profile  of  the  user's  input 
text,  and  for  building  the  concordance.  Finally,  Reporter  module  accepts  the  profile, 
creates  the  analysis  and  writes  out  the  report, 

Some  fine  tuning  of  The  Stylist  Is  readily  available.  The  global  constants  give 
you  the  capability  to  adapt  The  Stylist  to  your  operating  environment.  For  example, 
"Maxsent"  allows  you  to  determine  the  maximum  number  of  sentences  in  the  input 
text.  Moreover,  by  manipulating  the  weighting  factors  in  the  Calculate  procedures  of 
the  Reporter  module,  you  can  adapt  The  Stylist  to  the  norms  of  a  particular  school  of 
writing,  such  as  submissions  to  magazines  as  varied  as  "Fantasy  and  Science  Fiction" 
and  "Foreign  Affairs." 
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AP'  «  SD1X  C 

THE  CODE  OF  THE  STYLIST 


(X  NOTE  .  THE  FOLLOWING  IS  NOT  IN  EXECUTABLE  ORDER. 

IT  IS  IN  THE  ORDER  OF  GREATEST  CONCEPTUAL  CLARITY.  X) 


(X*S60000X) 

( XKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKXXXXXXXXXKXXXXXXXXXXXXXKXXXXX) 


(X  X) 
<X  THE  STYLIST  x> 
(X  X) 
(X  ■  Pascal  Program  for  Analyzing  Proaa  Styla  X) 
(X  by  Lieutenant  Thomas  C.  Cool*  USN  X) 
(X  Submittad  in  partial  fulfillmant  of  tha  raquiramania  X) 
(x  for  the  degree  of  x) 
IX  MASTER  OF  SCIENCE  IN  COMPUTER  SCIENCE  X) 
(x  from  tha  X) 
(X  NAVAL  POSTGRADUATE  SCHOOL  X) 
(X  June  1987  X) 
(X  X) 


(XXXXXXXXXKXXXXXXXKXXXXXXXXXXXXXXXXXXKXXXXXXXXXXXXXXXKXXXXXXXKXXXXXXXX) 

program  Stylist  (input,  output); 

const 

maxsent  3  500;  (x  limit  of  number  of  sentences  in  intaxt  X) 

wordlength  *  15; 
linalength  3  40; 

phraselength  3  4;  (X  length  of  phrase  around  condcordance  linaX) 


freqnum  3  50;  (X  tha  three  numbers  determine  how  often  X) 
salnum  »  200;  (X  Stylist  reports  to  user  during  axacutiomX) 
whandnnenum  3  1000000;  (X  frequently,  seldomly  or  whan  done  x) 


type 

(X  GLOBAL  TYPES  X) 


Genretype  3  (Nonfiction,  Fiction); 

Frequeneytype  3  (Frequently,  Stldomly,  When_Done); 

Wordtype  3  packed  array  (. 1 . .wordlength. )  of  char; 

Linotype  3  packed  array  (. 1 .. linalength. )  of  char; 

User^Infotype  3  record 

Nam*_of„text  i  linetype; 

Genre  i  Genretype; 

Size  t  Integer;  (x  users  guess  of  intext  length  x) 

Frequency  i  Frequeneytype;  (x  how  often  user  wants  reports  x) 

Want_Concord  t  boolean; 

Want_Dix_Grow  t  boolaan; 
end; 


(X  READER  MODULE  TYPES  X) 

Readtype  3  record  (X  created  by  Reader,  passed  to  Researcher  x) 
Word  i  wordtype; 

Length  ■  integer; 
end; 


(X 


RESEARCHER  MODULE  TYPES 


X) 


Balancetypa  *  (Plus,  Zero,  Minus);  (x  Balanca  of  Nods  in  BST  X) 


Statustype  8  (Storable,  Not_Storable,  NotJ/aluabla) ; 

(X  Storabla  -  a  full  noda  wi?h  all  valuas  which  X) 
•  (X  can  ba  addad  to  tha  dictionary  X) 

(X  Not_Storabla  -  a  noda  of  a  variant  (-ing,  -ad)  x) 

(X  word  which  has  values  but  should  X) 

(X  not  ba  addad  to  dictionary  x) 

(X  Not_Valuabla  -  a  noda  of  a  naw  word  without  x) 

(x  valuas.  Cannot  ba  Add  Valued  and  X) 

(x  cannot  ba  addad  to  dictionary.  x) 

Sourcatypa  -  (Latinate,  Germanic); 

Difficuitytypa  3  (PostQrad,  Grad,  High  School,  Elamantary); 


Concratanasstypa  3  (Tangible,  Intangible); 

Emotiontype  -  (Sublime,  Pleasant,  Neutral,  Unpleasant,  Horrid); 
Vigortype  3  (Violent,  Energetic,  Calm,  Inert); 

(x  tags  which  describe  each  word  in  tha  dictionary  X) 

EntryPointar  ■  -Entrytype; 

ConcordPointar  3  -Concordtype; 

Entrytype  3  record  (X  main  noda  in  tha  BST  X) 

Word  i  Wordtype;  (x  word  is  key  field  x) 

Balanca  t  Balancetype; 

Status  t  Statustype; 

Source  i  Sourcatypa; 

Difficulty  i  Difficuitytypa; 

Concreteness  >  Concratanasstypa; 

Emotion  '  Emotiontypa; 

Vigor  i  Vigortype; 

Left,  Right  i  EntryPointar; 

Down  i  Concordpointer; 
and; 

Concordtypa  ■  record  (x  a  Concord  noda  is  a  cluster  of  pointers  x) 

Up  i  EntryPointar;  (X  which  create  linked  lists  from  tha  firstx) 

Down,  (X  word  of  tha  Intext  to  the  last,  and  fromX) 

Next,  (X  the  last  to  tha  first,  Used  only  to  x) 

Last  t  ConcordPointar;  (X  create  tha  concordance.  x) 

end; 
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LtraPerWordtype  ■  array  ( ,  0 . , wordlangth. )  of  integer; 
(X  array  of  total  number  of  worda  of  aach  length  X) 

WordaPerSenttypa  ■  array  (.0. .maxaent.)  of  intagar; 

(x  array  of  tne  langth  of  aaoh  aantanoa  X) 


Profilatypa  ■  racord 


Totalworde  t  integer; 
Totallettars  >  integer; 
Totalaantaneaa  i  integer; 
TotalStrucltrs  t  intagar; 


(X  tha  profila  ia  tha  main  produot  x) 
(x  of  tha  Rasaarohar  Modula.  Hara  X) 
(x  Raaaarehar  kaaps  track  of  all  X) 
(X  count*.  X) 

(X  Total  numbar  of  worda  in  intaxt  X) 


(X  Total  of  ltra  of  atruotural  wrda  X) 


NumWordaThiaSant  i  intagar;  (X  •  worda  in  aantanoa  baing  raad 


(X  raat  of  profila  ara  count*  of  X) 
(x  worda  of  tha  intaxt,  found  in  X) 
(X  the, dictionaries,  which  hava  X) 
(x  thaaa  cohriaponding  value*  x) 


Nawwords  t  intagar;  (X  #  worda  uaar  add*  to  dixonary  X) 

KnownWorda  i  intagar;  (X  I  worda  of  intaxt  found  in  dixon.X) 

VariantWorda  i  intagar;  (X  |  variant  worda  of  intxt  in  dixonX) 

Struoturaworda  i  intagar;  (X  I  atuctura  worda  of  intxt  in  4dixx) 

UnknownWorda  i  intagar;  (x  t  intaxt  worda  not  found  anywharaX) 

Numfound  t  intagar;  (XI  total  worda  minua  unknown  word**; 

LtraParWord  t  ltraparwordtypa; 

WordaParSant  i  wordaparaanttypa; 

Av*_ltra_par_word  i  raal; 

Ava  wrda_.par.aant  i  raal; 

NumLatinata  i  intagar;  <x  raat  of  profila  ara  eounta  of  x)  ' 

NumOarmanic  i  intagar;  (x  worda  of  tha  intaxt,  found  in  X.v 

NumPoatGrad  t  intagar;  (X  tha . dictionariea,  which  hava  X) 

NumOrad  i  integer;  (x  thaaa  cohrtfaponding  value*  x) 

NumHigh.Sohool  <  intagar;  v 

NumEl amenta ry  i  intagar; 

NumTangibla  i  intagar; 

Numlntangibl#  i  intagar; 

NumSublima  >  intagar; 

NumPlaasant  i  intagar; 

NumNautral  i  intagar; 

NumUnplaasant  t  intagar; 

NumHorrid  i  intagar; 

NumViolant  i  intogar; 

NumEnergetic  i  intagar; 

NumCalm  i  intagar; 

Numlnart  i  intagar;  , 

and; 

Phraaatypa  •  packed  array  < . 1 . . phraaalength. )  of  wordtypa; 

(x  holds  tha  phraaaa  bracketing  tha  key  worda  of  concordanoa  line  X) 


REPORTER  MODULE  TYPES 


Langthtype  ■  (ToeShort*  Short/  Medium*  Long*  TooLong) 
Etymology typo  ■  (TooBorr awed*  Borrowed*  Mixed*  Native 
Modulationtype  *  (Bad*  Average*  flood); 

Runonatypa  ■  (Unacceptable*  Acceptable*  Nonexietant) ; 
Tangibilitytype  ■  (Soft*  firm*  Solid); 

Strengthtype  ■  (Weak,  Lively*  Strong*  VeryStrong); 
HardWcrdatype  ■  (Easy*  Challenging*  Hard*  VeryHard); 
Emotionelitytype  *  (Poor*  Standard*  Rich); 

Tonatype  «  (Negative*  Bland*  Positive); 

(X  Judgements  of  intaxt  style  Reporter  makes  based  on 

Anelysistype  ■  record 

WordLength  i  Langthtype; 

SentLength  i  Langthtype; 

Modulation  t  Modulationtype; 

Runons  t  Runonatypa; 

Etymology  i  Etymologytype; 

Tangibility  i  Tangibilitytype; 

Strength  i  Strengthtype; 

Hardwords  t  HardHordatype; 

Emotionality  i  Emotionelitytype; 

Tone  i  tonatype; 
and; 


TooNative) 


profile  x) 


GLOBAL  VARI 


x) 


(X 


Info  i  Uaar.Ihfotypa; 

Allswall  i  boolaan;  (X  global  flag  that  axaeution  oan  continua  x) 

(X  usad  instaad  of  a  GOTO  and  of  program  x) 

FourRoot,  (X  Root  of  tha  FourDix  BST  X) 

Root  i  EntryPointarj  <x  Root  of  tha  Dixonary  BST  x) 

Laatward  i  ConoordPointar;  (x  Pointar  to  last  word  road  X) 


Intaxt, 

Dixonary, 

FourDix, 

Concrdnc, 

NawWords, 

Raport  i  taxt; 


(X  fils  that  holds  usars  prosa  X) 

(X  dictionary  of  words  and  valuas  x) 

(x  diction,  of  short,  functional  words  X) 
(X  concordanca,  craatad  by  rasaarchar  X) 
(x  list  of  words  not  found  or  addad  X) 

(X  main  product  of  Raportar  x) 


Tally,  (X  Rasaarchar  scratch  pad  of  profila  X) 

Profila  t  profilatypa;  (X  Rasaarchar  input  to  Raportar  M) 

Analysis  i  analysiatypa;  (X  Judgamants  mada  by  raportar  x) 


(X  MAIN  PROGRAM  X) 

basin  (X  main  program  XJ 
IntarrogataJJsar; 
if  Allswall  than  Init_Raadarj 
if  Allswall  than  Init^Raaaarohar; 
if  Allswall  than  Init^Raportar; 
if  Allswall  than  Raad^Intaxt; 

If  Allswall  than  Calculata.Profila; 

If  Allswall  than  Analyza_Profila_and_Raporiw 
If  Allswall  than  if  Info ,Want_Concor3  than 
Nrita_Concordancaj 

If  Allswall  ^Ehan  if  Info .Want_Dix„Orow  than 
Stora_.NawJDix> 

If  not  Allswall  than 

Writaln  C'Exacution  tarminatad  until  you  olaar  that  up.'); 

and. 
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procedure  Intarrogatajlsar; 
var  c  i  char; 

good_intagar  i  boolaan; 
i>  j  .  intagar; 
bagin 
Paga; 

Writaln;  Writaln;  Writaln; 

Writaln  ('THE  STYLIST*  «49);  Writaln;  Writaln  ('by  IT  TC  Cool *.50) 
Writaln;  Writaln;  Wri-taln;  Writaln; 

Writaln  ('What  ia  tha  nama  of  tha  taxt  to  ba  analyzad? ' .65); 
i  i*0; 

whila  (i  <  linalangth)  and  not  (EOLN)  do  bagin 
i  i*  i  +  1; 

Read  (Info ,Nama_of_Taxt( . i . ) ) ; 
and;  Raadln; 

for  j  i*  i  +  1  to  linalangth  do  Info . Nama_of_Taxt( . j . )  .  =  1  ' ; 
Paga; 

Rapaat 

Writaln;  Writaln;  Writaln; 

Writa  Ola  ' ,  Info.Nama_of_Taxt); 

Writaln  ('  aavad  aa  "Inlfax?  taxt  a"T');Writaln; 

Writaln  ('  1)  Yaa  ».45); 

Writaln  ('  2)  No  1  .45); 

Raad  (o);  Raadln;  Writaln  (c.45); 

Until  ( c*  *1')  or  (c  *  *2'); 

if  (o  *  '1')  than  bagin  Writaln  ('That  ia  good. *>45); 

Allawail  i*  trua;  and 

alaa  bagin  Writaln  ('That  ia  too  bad. '.43); 

Allawail  t*  falaa;  and; 

if  Allawail  than  bagin 

Rapaat 

Paga;  Writaln;  Writaln;  Writaln; 

Writaln  (Info.Naffla_of_taxt.55) ;  Writaln; 

Writaln  (•  ia  '.45); 

Writaln;  Writaln; 

Writaln  ('1)  Fiction  '.45); 

Writaln  ('2)  Nonfiction' t 45); 

Writaln;  Writaln; 

Raad  (c);  Raadln;Writaln  (ci45>; 

if  <c  ■  '1')  than  Info.Oanra  •  ■  Fiction 

alaa  if  (c  *  '2')  than  Info.Oanra  i*  Nonfiction; 

Until  (c  ■  '1')  or  (c  ■  '2'); 

Rapaat 

Paga;  Writaln;  Writaln;  Writaln; 

Writaln  ('  REPORT  '.45);  Writaln; 

Writaln  PI)  Fraquantly' > 45) ; 

Writaln  C2)  Saldomly  '.45); 

Writaln  P3)  Whan  Dona  *  1 455 ;  Writaln; 

Raad  (o);  Raadln;  Writaln;  Writaln  (c);  Writaln; 
if  (c  *  '1')  than  Info . Fraquancy  «*  Fraquantly 
alaa  if  (c  ■  '2')  than  Inf o . Fraquancy  .a  Saldomly 
also  if  (c  »  '3')  than  Info. Fraquancy  .*  Whan_Dona; 

Until  (c  »  ' 1 ' )  or  (o  *  '2')  or  (o  ■  '3'); 


Rapaat 

Pag*;  Writaln;  Writaln;  Writaln; 

Writaln  ('  CONCORDANCE  '145);  Britain; 

Writaln  (•  1)  Y«a  '<43); 

Writaln  <•  2)  No  'i43); 

Road  <c);  Raadln;  Writaln  <c<43); 

if  (o  »  *1')  than  Info .Want_Concord  t"  trua 

alaa  if  (e  ■  '2')  than  Info. Want_Concord  <■  falaa; 

Until  ( c*  '1')  or  Co  ■  *2'); 

Rapaat 

Paga;  Writaln;  Writaln;  Writaln; 

Writaln  ( ' Approximataly  how  many  words  ara  in  ' ); 

Writaln  (Info.Nama.of^taxt,  •  T'>3); 

Info.aixa  ■  ■  0; 

Oood.intagar  i ■  trua; 
whila  not  EOLN  do  bagin 
Road  <e); 

if  (c  in  (.’O'. .'9'.))  than 

Info.sixa  i»  Info.siza  H  10  t  ord(c)  -  ord('O') 
alaa  bagin  good_intagar  i*  falaa;. 

Writaln  (c<2,  1  is  an  invalid  charactar. ' );  and; 

and;  Raadln; 

Uhtil  Oood„Intagar; 

Writaln  CInfo.ai«ai45>; 

Rapaat 

Paga;  Writaln;  Writaln;  Writaln; 

Writaln  (•  EXPAND  DICTIONARY  T  '>45);  Writaln;  Writaln; 
Writaln  (•  1)  Yaa  *<45); 

Writaln  («  2)  No  'i45); 

Rood  (o);  Raadln;  Writaln  (ct4S); 

if  (c  ■  '1')  than  Info .Want_Dix_Grow  i *  trua 

also  if  (o  *  *2')  than  Info. WantJ3ixJJrow« "falaa; 

Until  <o  «  *1')  or  (c  ■  '2'); 
and;  (*  if  allswall  *) 
and; 


Srocadura  InitJ?aadar; 

•sin 

Raaat  (Intaxi*  'Intaxt  taxt  a'); 
and; 


prooadura  Init_Raaaarchar; 
var 
i* 

atapoount* 
thraahhold* 
dixlanoth  i  intagar; 

•ntry  i  antrytypa; 
balanoad  i  boolaan; 
baoin 

Raaat  (Dixonary*  'Dixonary  taxt  a'); 

Rawrita(Nawworda,  'Mawworda  taxt  a'); 

if  Inf6.Want_Conoord  than  Rawrita  (Conerdnc*  'Conerdnc  taxt  a'); 
if  (Info.fraquancy  <>  WhanJ)ona)  than  baoin 
Dixlanoth  i *  0; 

whila  not  EOF  (Dixonary)  do  bagln 

Raadln  (Dixonary);  Dixlanath  < ■  dixlanoth  +  1; 
and; 

Raaat  (Dixonary/  'Dixonary  taxt  a'); 
and; 

eaia  Info . Fraquancy  of 

Fraauantly  i  thraahhold  ;■  fraqnum; 

Saldomly  i  thraahhold  * ■  aalnum; 

WhanJSona  i  thraahhold  t  “  HhanDonanunt; 
and;  (*  oaaaa  M) 

Naw  (Root); 

Load  (2,*  Root-); 

Root-.laft  i*  nil;  Root-. right  nil; 

Root- .down  i “  nil; 

Root-.balanca  «■  Zaro; 

Root-.atatua  i"  Storabla; 

Laatward  t"  nil; 

Stapcount  i *  0; 

whila  not  EOF  (Dixomry)  do  basin 
Load  (2*  Entry); 

Entry. balanoa  i»  Zaro;  balanoad  ■  *  falaa; 

AVL  Inaart  (Entry*  Root*  balanoad); 
if  (Info.fraquancy  <>  Whan_Dnna)  than  baoin 
atapcount  i*  atapoount  +  1; 
if  (atapoount  mod  thraahhold  “  0)  than  basin 
Pioa;  Writaln;  Writaln;  Writaln;  Writaln; 

Wri+aln  ('Stylist  is  now  loading  its  dictionary.  *  1 30 ) ; 
Writaln;  Writaln; 

Writaln  ('  Last  word  loadad  waa  ' 1 30 ) ; 

Writaln  ( a,>try .  word  t  AQ) ;  Writaln;  Writaln; 

Writaln  ('Stylist  has  loadad  ' iS0*wtapoounti5* '  antriaa.'); 
Writaln  ('out  of  a  dictionary  of ' i 3)* Dixlangthi 5* '  words.') 
Writaln;  Writaln; 

Writaln  (StapoountMlOO  div  Dixlanoth t 33*  '  X  oomplata. ' ); 
and;  (W  if  *) 
and;  (*  if  0) 
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with  ■'tally  do  basin 
totnllattara  »■  Oj 
totalworda  i*  0; 
totalaantancaa  »■  0; 
totalatrucltra  i»  Oj 
numwordathiaaant  i B  0 i 

for  i  i*  0  to  wordlangth  do  ltraparword< ,i .)  0/ 

for  i  i*  0  to  maxaant  do  wordaparaantC .1 . )  i*  0; 
KnownWorda  i ■  0 ; 

NawWorda  0; 

Numfound  i«  0; 

VariantWorda  t»  0; 

Structuraworda  t*  0; 

UnknownWorda  t«  Oi 
NumLatinata  n  0; 

NumGerinanic  i  »0 ; 

NumPoatGrad  » * 0 ; 

NumOrad  •  0; 

NuitiHieh^sohool  i*  0j 
Numclamantary  :«  Oj 
NumTanoibla  i«  0/ 

Suwlntangibla  i«  0/ 
umSublima  t"  0j 
umPlaaaant  i ■  Oj 
NumNautral  i*  Oj 
NumUnplaaaant  i*  Oj 
NumHorrid  i ■  Oj 
NumViolant  i *  Oj 
NumEn*.rgotio  i *  Oj 
NumCalrn  0J 


Numlnart  i ■  Oj 
id;  (*  with  tally  do  N) 

Raaat  (FourDix*  'FourOix  taxt  a'); 

New  (FourRoot);  FourRoot-. right  t»  nil;  Fo 

Load  (1*  FourRoot-) j 

Attach  (FourRoot-*  FourRoot); 

FourRoot-. balanoa  i «  Zaroj 
While  not  EOF  (FourDix)  do  bagin 
Load  (1*  Entry); 

Entry. balanoa  i*  2oroj 
balanoad  i*  falaaj 

AVL.Inaart  (Entry*  FourRoot*  balanoad); 
and; 


FourRoot-. laft 


procedure  Load  Cfilenum  t  intis 
ver 

aourceltr/  diffioultvltr* 
concretenessltr,  emotionltr, 


integer;  ver  entry  i  entrytype); 


vigorltr 
begin 
If  <filan 


cherj 


(filanum  ■  1)  then  Reedln  (FourDix,  Lntry.word, 

eourceltr,  difficultyltr, 
concretineeeltr,  emotionltr# 
vigorltr) 

(Dixonary>  Entry .word>sourcaltr,  difficultyltr, 

conoreten«issltr,  emotionltr, 
vigorltr); 


•lee  Reedln 


with  en+ry  do  begin 
if  (eourceltr  *  *1') 

case  difficultyltr  of 
* p •  i  difficulty  i 


then  source 
else  source 


.  - -  .  Postgrad; 

'o'  i  difficulty  i*  Grad; 

•M  i  difficulty  i ■  High_3chool; 

'a*  i  difficulty  i«  Elementary; 

end; 

if  (coneretanessltr*  1 1 *  J  then  concreteness  i ■  Tangible 

alee  concretenees  t*  Intangible; 


i>  Letinate 
i«  Germanic; 


case  emotionltr  of 
'a*  i  emotion  i 
'o'  t  emotion  i 
'n*  •  emotion  t 
•u*  i  emotion  i 
* h *  i  emotion  i 
end; 

case  vigorltr  of 
•v»  i  vigor  i« 

•  vigor  i“ 


* c #  »  vigor  i"  Calm; 

*  1 •  i  vigor  t*  Inert; 
end; 

end;  (M  with  entry  do  m) 
end; 


Sublime; 

Pleasant; 

Neutral; 

Unpleasant; 

Horrid; 


Violent; 

Energetic; 


procadura  AVC_Inaart  (Entry  i  antrytypa;  var  p  »  Entrypointar; 

var  balanced  i  boolaan); 
var 

pl*  p2  t  antrypointar; 
basin 

If  (p  ■  nil)  than  basin 
Attach  (Entry*  p); 

«-• . balanca  i ■  Zero; 
balancad  i*  true; 
and 

alsa  if  (Entry, word  <  p'.word)  than  basin 
AVL_.Inaert  (Entry*  P'.left*  balancad); 

If  Balancad  than  (X  laft  pointar  haa  grown  highar  X) 
caaa  P'. balance  of 

Plua  i  bagin  p'. balance  Zaro;  balancad  i*  falsa; 
Zaro  i  p-. balance  i*  Minus; 

Minus  i  basin  <x  rebalance  K) 
pi  i ■  p-.iaft; 

if  (pi1*, balanca  *  Minus)  than  begin(x  aingla 
P'.left  i«  pl^. right; 
pl'. right  p; 

P'. balance  t"  Zaro; 
p  •  *  p1; 
and  (x  if  X) 

alaa  bagin  (X  double  LR  rotation  X) 
p2  i ■  pi'. right; 
pl'. right  •“  p2',left; 
p2',laft  t •  pl i 
P'.laft  i«  p2', right; 
p2', right  i"  p; 

if  (p2', balanca  ■  Minus)  than  p', balanca 
alaa  p*. balanca  »■  Zaro; 
if  (p2-. balanca  ■  Plua)  than  pl'.balanoa 
alaa  pl'.balanoa  i *  Zero; 

P  i*  p2; 
and;  (X  alaa  X) 

P', balance  i«  Zaro;  balancad  >•  falsa; 
and;  (X  caaa  of  Minus  X) 
and;  (X  of  oases  X) 
and  (X  if  Entry, word  <  P'.word  X) 


and; 

LL  rotatM) 

i*  Plua 
i»  Minua 


•lac  if  (  Entry. word  >  p-.word)  than  begin 

AVL_Inaert  (Entry,  p-. right,  balanced) ; 
if  balanced  then  <x  right  pointer  haa  grown  higher  x) 
oaa*  p-, balance  of 

Minua  t  begin  P-. balance  i*  Zaro> 
balanced  t»  false;  end; 

Zero  i  p-, balance  i *  Plua; 

Plua  i  begin  (x  rabalanoa  X) 
pi  i«  p-. right; 

if  (pi-. balance  «  Plua)  then  begin 
(X  aingle  RR  X) 

P-. right  »■  pi-. left; 
pl-.left  i *  p; 

>  p->.  balance  ia  Zero; 

p  i a  pi; 
and  (x  if  X) 

alee  begin  (x  double  RL  rotation  M) 
p2  i»  pl-.left; 

Pl-.left  i«  pZ-. right; 
p2-. right  t»  pi) 

P-. right  *“  p2-.left; 
pZ-.left  i*  p; 

if  (p2-. balance  ■  Plua)  than 
P-. balance  i»  Minua 
•lae  p-. balance  i ■  Zero; 
if  (p2-. balance  •  Minua)  then 
pi-. balance  i*  Plua 
•lae  pi-. balance  t ■  Zero; 

end;<S>(x  double  RL  rotation  X) 

P-. balance  i*  Zero;  balanoed  i ■  false; 
end;  (X  caae  of  balance  *  Plua  x) 
and;  (X  of  caaea  X) 
end  (X  of  if  Entry. word  >  p-.word  X) 
else  balanced  i*  false; 
end;  (X  of  procedure  AVL_Inaert  X) 


procedure  InitJUporter; 
begin 

Rewrite  (Report,  'Report  text  •'); 

Writaln  (Report);  Writeln(Report) ;  Writeln(Report) ; 
Writaln  (Report,  Info  ,  Name_,of  .Text  1 45) ; 

Writaln  (Report,  • PROFILE' > 437; 

Writeln  (Report);  Writeln(Report); 

and; 
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procedure  Read_Intext; 
ver 
i* 

letternum* 
wordnum* 
thraahhold* 
stepcount* 
offset  i  integer; 
c  i  ohar; 
n  i  readtype; 
begin 

lettarnum  •*  0; 
wordnum  i *  0; 
stepcount  >a  0; 
case  Info . Frequency  of 

Frequently  i  thraahhold  i*  freqnum; 

Seldomly  i  thraahhold  i«  selnum; 

Whon„Done  i  thraahhold  i«  WhenDonenum; 

end;  (X  oeeea  X) 

offset  i«  ord  ('A')  -  ord  C  *  a • ) ; 
while  not  EOF  (Intext)  do  begin 

while  not  EOLN  (Intext)  and  Allawell  do  begin 
Read  (Intext*  o); 

if  (o  in  C . ' A '..'Z',))  then  o  »■  ehr(ord(e)  -  offset); 
if  (Co  in  (.'a'. .'a'.))  or  (e  «  Suce('9')> 

or  (o  in  ( . *0 ' . . *9* . )))  then  begin 
letternum  ■*  lettarnum  +  1; 

if  (lettarnum  <■  wordlength)  then  n.wordC .letternum. )  t*  o; 
and; 

if  (letternum  >  0)  then  begin 

if  (c  ■  '  ' )  or  (c  ■  '-')  or  EOLN  (Intext)  or 

(c  ■  ' . 11 )  or  (c  *  '?')  or  (a  ■  '!*)  or  (c  ■  ';')  then 
begin 

for  i  t«  letternum  +  1  to  wordlength  do  n.wordC .i . ) i *  *  1 
if  (letternum  >  wordlength)  then  n. length  is  wordlength 
else  n. length  i *  letternum; 

Store_.Intext  (n); 

Wordnum  i*  Nardnum  +  1; 
lettarnum  i«  0; 

if  (o  *  '.')  or  (c  ■  '?')  or  (c  *  '!')  or  (o  ■  ';')  than 
if  (wordnum  >  1)  then  begin  (X  not  embedded  command  x) 
Signal^End^of^Sentence; 
wordnum  i ■  0; 

end; 

atepoount  i*  stepcount  +  1; 
if  (stepoount  mod  thraahhold  *  0)  then  begin 
Page;  Writeln;  Writeln;  Writeln;  Writeln; 

Write  ('Stylist  is  now  reading  i  '); 

Writeln  ( Info. Name  of_text); 

Writeln  ('Last  word  read  was  '»  n.word); 

Writeln;  Writeln; 

Write  ('Stylist  has  read  '*  stepcount ■ 5) ; 

Writeln  ('  words.  ');Writeln> 

Writeln;  Writeln; 

Writeln  (StapcountxlOO  div  Info.SizetS*  '  X  complete.'); 
and;  (X  if  x) 
and; 

and;  (X  letternum  >  0  X) 
end;  Readln  (Intext); 
end;  (X  while  not  EOF  X) 

Signal„End_of_File; 

end; 


procedure  Store  Intext  (n  i  raedtype); 
var  cp  t  concordpointer; 
found  i  boolean; 
entry  i  entrytype; 
begin 

i*Hy‘t°t*}worde  i«  tally, totalwords  +  1; 
telly. totallettara  telly. totallattrrs  +  n,  length; 

telly, numwordsthiasent  i»  tally , numwordsthiasent  +  1; 
found  t»  felae; 

if  ^t^SS^d  th5i!  tH*n  L<)ok-Up-Four  <n'  FeurRoot,  found); 

R»o^} 1  v ■  tr-P.'Wordt .  n . l.n„th . ,  ♦  1, 

end;  ” 


S' 

y 

\ 
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procedure  Signal_End_of_Sentence; 
begin 

*"  ■k^llV'- totalsantenoea  +  1; 

>  maxaant)  then  begin 

end  n  ('INTEXT  TOO  BIO.1);  Allewell  i«  felae; 

*  tally,  totalsentences . )  i  ■ 
telly. numwordathiaeent; 
telly, numwordethieaent  i*  0; 
end; 

procedure  Signel_End.of_.File; 
begin 

if  Info. Went.Concord  then  Lestward-.naxt  i ■  nil; 
tel ly , numr ound  *“  tally . totalword*  -  tally, unknownwords; 
£a?;;  Writeln;  Writeln;  wn*nownwo™ 

Styliet  hae  finiehed  reading  i 
Writeln  (Info, Name  of  text); 
end; 


| 


I 


procedure  Look.llp.Four  (n  i  readtype;  vor  ancestor  i  entrypointer; 

found  i  boolean); 

var  cp  i  concordpointer; 

entry  i  entrytype; 
begin 

if  (ancestor  *  nil)  then  found  i*  false 
else  if  (ancestor*' .word  ■  n.word)  then  begin 
found  i*  true;  * 

if  (ancestor-. vigor  <>  inert)  then  AddJ/alues  (ancestor); 
tally. totelatrucltrs  i*  telly . totelstrucltrs  +  n. length; 

J jl ly . *t|*uctureworde  tally .struotureworda  +  1; 

if  Info , Want.Concord  then 

end  (^^c^Concor<:,-Nocl«  (ancestor#  cp); 

else  if  (ancestor-. word  >  n.word)  then 

Look  Up  Four  (n,  ancestor- . left ,  found) 

•  ls«  Look.Up^Four  (n»  •ncMtor^.riflht#  found); 

•nd  j 


R 


r 

1 


WIV *1* 


VSHim 


W-WJWjl 


iW, 


vy*  y*vv  vx 


£ 


procedure  Look-Up  (n  t  raadtype;  var  ancestor  t  antrypointar); 
var  cp  t  concordpointar ; 
ontry  i  entrytypa; 
pi  antrypointar ; 
found  i  boolean; 

>c  i  char; 
tiffin 

if  (ancestor  *  nil)  than  begin 

Look_Up  Variants  (n,  p,  found); 
if  found  than  begin 

tally. variantwords  i=  tally, variantwords  4-  1; 

New  (ancestor); 
with  ancestor-'  do  basin 
word  i*  n.word; 
left  ia  nil;  right  ia  nil; 
down  i "  nil ; 

end;  (X  with  ancestor  do  JO 
if  (p-. status  O  not_valuable)  than  basin 
Add.Valuas  Cp); 
with  ancestor"  do  bagin 
status  ia  not.Storablo; 
source  p->. source) 
difficulty  i=  P" , difficulty; 
conoratanass  t  =  p-. concreteness; 
emotion  t=  p-> amotion; 

;i  visor  i»  p-, vigor; 
and;  (X  with  ancestor  do  x) 
end  else  (x  p-. status  is  other  then  not_v*iuible  x) 
ancestor- .status  i=  not.velueble; 

end 

else  if  I/i  fa  .Want_Dix_Grow,  than  begin 

Repeat 

Pace;  Wniteln;  Writaln;  Writaln; 

Writaln  k'n.wordi  50,);  Writaln;  Writaln; 

Writaln  ( ’ADD  ITY ?  *  1 45) ;  Writelh;  Writaln; 

Writaln  ( ’l)  Yes' i 45); 

Writaln  ('2 )  No  •  i<*5); 

Raad  Cc);  Readln; 

Until  (c  a  '  1')  or:i<o  *  !2* ); 
if  (c  3  '1*)  than,  begin 
Entry. word  i*  h.WOrd; 

Get_Valuas  for^awifn+ry  (Entry); 

Attach  (Entry,  pmcastor); 

Tally . nawwords  i  =  tally . pewwords  +  1; 

Add_Veluas  (ancestor)!; 
and  alse  Attach_No_Valu,fe  (ancestor,  n); 

and  .7 

alsa  Attach_No_Valua  (ancestor,  n); 
if  Info .Want_Concord  than 

Attech_ConcordTNoda  (ancestor,  cp); 
and  (X  in  ancestor  »  nil  X) 
alsa  if  (ancestor- .word  s  n.word)  than  begin 

if  (ancastor- .status  <>  not  storable)  than 

tally . knownwords  i*  telly . knownwords  +  1 
else  tally. unknownwords  •»  tally. unknownwords  +  1; 
if  (ancestor- .status  <>  Not_Valuable)  than 
Add_ Values  (ancestor); 
if  Info.Want_Concord  than 

Attach_Concord_Node  (ancastor,  cp); 
and  (X  if  X) 

alsa  if  (ancestor- .word  >  n.word)  than 
look_Up  (n,  ancestor-* .left) 
else  Look_Up  (n,  ancestor- . right) ; 
and; 


procedure  Attach  (Entry  i  entrytypa;  var 
basin 
Naw  (p); 

with  p-  do  bagin 

word  i*  entry. word; 
status  t*  storable; 
sourca  i*  antrv .source; 
difficulty  «■  antry. difficulty; 
concratanaas  t*  antry . concratanaas; 
amotion  »■  antry. amotion; 
vigor  <“  antry. vigor; 
laft  ia  nil;  right  i»  nil; 
down  ia  nil; 
and; 
and; 


antrypointar); 


prooadura  Attach_Concord_Node  (p  i  antrypointar; 

var  cp  i  concordpointar) ; 

var  Bottom, 

Bottommost  i  concordpointar; 
basin 

Bottom  i ■  p-.down; 
whila  (Bottom  <>  nil)  do  bagin 
Bottommost  i *  bottom; 

Bottom  i®  Bottom-. down; 
and; 

Naw  (cp); 

With  op-  do  bagin 
Up  i=  p; 

if  (Laatward  »  nil)  than  Last  »■  nil 
alsa  last  is  Lastward; 

Down  t«  nil; 
and; 

if  (P-,down  «  nil)  than  p-.down  <■  cp 
also  Bottommost-. down  i«  cp; 
if  (Lastward  <>  nil)  than 
cp-. last-. next  i»  cp; 
lastward  i»  op; 
and; 

procadura  Attach_.No_Value  (var  ancestor  t  antrypointar; 

n  i  raadtypa); 

bagin 

Writoln  (nawwords,  n.word); 

Naw  (ancaator);  Ancastor-.word  «■  n.word; 
tally. unknownwords  »■  tally . unknownwords  +  1; 
Ancestor- . right  i«  nil;  Ancaator- . laft  i*  nil; 
Ancestor- . down  i*  nil; 

Ancestor- .status  i *  not_val uabla; 
and; 


i V»ViV»v  »>  u 


procadura  Look_Up-V»ri«nt»  (original  i  raodtypa;  var  p  i  antrypointar > 

var  found  >  bool  tan); 

var  variant  i  raadtypa; 

i  i  intagar; 
bagin 

found  )■  falsa/ 

if  (original .langth  >  3)  than 

if  (original ,word( .original . langth. )  ■  'a')  than  bagin 
variant, word  i“  original .word/ 
variant ,word( , original , langth. )  i«  '  '/ 
variant,  langth  i“  original .  langth  -  1/ 

LookJJp_,Variant  (variant,  Root,  found,  p); 

if  not  found  and  (variant. word? .variant. langth. )  *  'a')  than 
bagin 

variant, word? .variant. langth. )  ■■  ’  ’/ 
variant , langth  i*  variant . langth  -  1; 

Look-Up  Variant  (variant,  Root,  found,  p)/ 

if  not  round  and  (variant. word( .variant , langth. )  «  *i')  than 

bagin 

variant. word? .variant, langth. )  iB  'y'/ 

Look.JJpJ/ariant  (variant.  Root,  found,  p); 
and; 


alaa  if  (original , langth  >  4)  than 

if  (original .word? .original .langth, )  ■  'g')  and 
(original .word? .original. langth  -  1.)  ■  'n')  and 
(original .word? .original .langth  -  2.)  ■  *i')  than  bagin 
variant, word  i»  original .word; 

for  i  i*  original . langth  -  2  to  original . langth  do 
variant. word? ,i . )  t“  *  '/ 
variant. langth  i»  original. langth  -  3; 

Look_Up  Variant  (variant, Root,  found,  p); 
if  not  round  than  bagin 

variant. langth  i*  variant. langth  +  1; 
variant. word? .variant. langth.)  «■  'a'; 

Look_Up  Variant  (variant,  Root,  found,  p); 
if  not  found  and 

(variant .word? .variant, langth  -  2.)  * 
variant .word? .variant. langth  -  1.))  than  bagin 
variant . langth  ■■  variant . langth  -  2; 
variant .word( .variant . langth  +  1.)  i«  ' 
variant. word? .variant, langth  +2.)  i*  '  ’/ 
Laok„.Up_,Variant  (variant,  Root,  found,  p); 

and; 

and; 

and 


end; 

•nd; 


if  (original , langth  >4)  than 

if  (original .word( .original .langth.)  ■  * d' )  and 

(original .werd< .original . lansth  ”1.)  ■  '•')  than  bagin 
variant, word  i*  original .word)  .  . 

for  i  *■  original , length  -  1  to  original. langth  do 
variant. word( ,i . )  •*  *  1 ) 
variant, langth  i"  original. langth  -  2) 

Look_Up  Variant  (variant# Root/  found/  p); 
if  not  round  than 

if  (variant, word( .variant .langth. )  *  * i * >  than  bagin 
variant.word( .variant. langth. )  i*  'y') 
variant. langth  »*  variant . langth  -  1; 

Look_Up_Variant  (variant/ Root/  found#  p); 
and) 

if  not  found  than  bagin 

variant . langth  < ■  variant. langth  +  1) 
variant ,word( .variant . langth. )  i»  'a'; 

Look_Up  Variant  (variant#  Root#  found#  p>) 
if  not  round  and 

(variant .word( .variant, langth  -  2.)  « 
variant. word( .variant, langth  -  1.))  than  bagin 
variant, langth  i»  variant . langth  -  2) 
variant.word(,variant.langth  +  1.)  t«  '  ') 
variant. word( .variant . langth  ♦  2.)  »**  *  ') 
LookJJp_Variarit  (variant#  Root#  found#  p>) 

and) 

and) 

if  (original. langth  >  4)  than 

if  (original .word( .original . langth. )  a  'y')  and 

(original ,word( . original . langth  -  1.)  ■  *  1  * )  than  bagin 
variant, word  i*  original .word) 

for  i  i*  original .langth  -  1  to  original. langth  do 
variant. word( .i . )  i«  *  ') 
variant. langth  i*  original .langth  -  2) 

Look-Up Variant  (variant# Root#  found#  p)) 
if  not  found  than  bagin 

variant. langth  i»  variant. langth  t  2) 
variant. word< .variant. langth.)  *a') 
variant. word< .variant .langth  -  1.)  i*  '1') 

Look_Up  Variant  (variant#  Root#  found#  p)) 
if  not  round  and 

(variant .word( .variant. langth  -  2.)  »  * i * )  than  bagin 

variant. langth  »“  variant . langth  -  2) 
variant.word( , variant . langth . )  i"  'y') 
variant. word( .variant. langth  +  l.J  ■*  '  ') 
variant, word( .variant. langth  +  2.)  i»  *  ') 
Look_Up_Variant  (variant#  Root#  found,  p)> 

and) 

and) 


procadura  Look_Up..Variant  (variant  i  raadtypaj 

v«r  ancestor  i  Entrypointar; 
var  Pound  t  Boolean; 
var  p  »  Entrypointar); 

if  (ancestor  ■  nil)  than  found  t«  falsa 
alsa  if  (ancaator-.word  «  variant, word)  than  basin 
found  i»  trua; 

P  ancestor; 
and 

alsa  if  (ancaator-.word  >  variant. word)  than 
A ,  ^?L<-Hp_Variant  (variant,  ancestor-,  left,  found, 

alsa  Look_Up_Variant  (variant,  ancaator-, rioht,  found, 
ana; 


p) 

p) 


procedure  AddJ/aluas  (p  t  antrypointar); 
basin 

if  (P7.aourca  «  Latinate)  than 

Tally.Numlatinata  .«  Tally. NumLatinata  4  1 
alsa  Tally. NumOarmanic  t«  Tally ,  NumGeriwanic  +  1/ 
oaaa  p-. difficulty  of 

PoatGrad  <  Tally. NumPoatGrad««  Tally. NumPostGrad  + 

Pro-a  i  Tally. NumGrad  t«  Tally . NumGrad  4  1; 

Tally. NumHish_School  .■  Tally , NumHigh_School  +  1 
Tally. NumElamentary  i»  Tally .NumElamantary  +  1; 


1; 


High_School 
Elamantary 
and; 

if  (P-.cpnor*tanaaa  «  Intansibla)  than 
i  Tally. Numlntansibla  i«  Tally .Numlntansibla  4  1 
•1*»  Tally. NumTansibla  Tally. NumTansibla  4  1; 
oaaa  p-, emotion  of 

iblima  .  Tally. NumSublima  .■  Tally. NumSublima  + 
Tally . NumPleaaant  t -Tally . NumPleaaant 
Tally, NumNautral  i«  Tally. NumNautral  4 
Tally. NumUnpl aasant  i *  Tal ly . NumUnpl aaaant 
Tally. NumHorrid  Tally. NumHorrid  4  1; 


Sublima 
Plaasant 
Neutral 
Unplaaaant 
Horrid 
and; 

casa  p-. vigor  of 
Violant 
Enargatic 
Calm 
Inart 
and; 
and; 


1; 

+  1; 
1; 


Tally. Numvioiant  ta  tally. numviolant  4  lj 
Tally. Numtnarsatio  .■  Tally .NumEnargatic  4 
Tally. NumCalm  i»  Tally . NumCalm  4  1; 

Tally. Numlnart  i*  Tally . Numlnart  4  1; 


+  1; 


1; 


procedure  Gat,_Values_for_New_Entry 
var  c  i  char; 

flOod_char  i  boolean; 
begin 

Repeat 

Writeln;  Writeln;  Writeln; 

Writeln  ('SOURCE  • » 45 ) ;  Britain; 
Writeln  (Entry, wordi45); 

Writeln;  Writeln; 

Writeln  Cl)  LATINATE  '<45); 
Writeln  C2)  GERMANIC  'i45); 

Read  (c);  Writeln  (ci45);  Reedln; 
Until  ( c  ■  '1')  or  (o  *  '2'); 
if  (c  a  '1')  then  Entry, source 
else  Entry.source 

Repeat 

Writeln;  Writeln;  Writeln; 

Writeln  ('DIFFICULTY  LEVEL 
Writeln;  Writeln; 

Writeln  (Entry. word<45);  Writeln; 
Writeln  Cl)  POSTGRADUATE' r45); 
Writeln  C2)  GRADUATE  *  1 45); 
Writeln  C3)  HIGH  SCHOOL  •  * 45) ; 


Entry  t  entrytype); 


Writeln; 


Letinate 
ia  Germanic; 


1 45); 


Writeln; 


Writeln  C4)  ELEMENTARY 
Read  (o);  Writeln  (ei45); 
Until  <o  ■  '1')  or  (c  »  '2') 
case  c  of 


1 45) ; 
Reedln; 
or  <c  ■ 


Entry . difficulty 
Entry, difficulty 
Entry. difficulty 
Entry. difficulty 


PoetGrad; 

Grad; 

High.Sohoel; 

Elementary; 


Writeln; 

*>45);  Writeln;  Writeln; 
Writeln;  Writeln; 


•  1  • 

'2* 

'3  • 

'4' 
and; 

Repeat 

Page;  Writeln;  Writeln; 

Writeln  ('CONCRETENESS 
Writeln  (Entry, wordi45); 

Writeln  Cl)  TANGIBLE  » *45); 
Writeln  C2)  NOT  TANGIBLE  *  1 45) ; 
Raad(c);  Reedln;  Writeln;  Writeln 
Until  (o  *  '1')  or  (c  *  *2'); 
if  (c  ■  '1')  then  Entry. concreteneee 
else  Entry, concreteness 

Repeat 

Page;  Writeln;  Writeln;  Writeln; 
Writeln  ('EMOTIONAL  CONNOTATION  * 
Writeln  ( Entry. word* 45 )j  Writeln; 


Writeln 
Writeln 
Writeln 
Writeln 
Writeln 
Raad(c) ; 
Until  (c  » 
case  c  of 


Cl) 
C2) 
(  '3) 
C  *  4 ) 
C5) 


Reedln; 


SUBLIME 
PLEASANT 
NUETRAL 
UNPLEASANT 
HORRID 


Entry. emotion 
Entry. emotion 
Entry. emotion 
Entry. emotion 
Entry. emotion 


Rapaat 

Pasa;  Writaln;  Writaln;  Writaln; 

Writaln  < 'VIGOR' i45)j  Writaln;  Writaln; 
Writaln  (Entry. wordt45);  Britain;  Writaln; 
Writaln  ('1>  VIOLENT  •  «4S) ; 

Writaln  C2)  ENERGETIC  e<45); 

Writaln  CS)  CALM  '»45 ); 

Writaln  C4)  NONE  'i45>; 

Rand  (c);  Writaln  C o > 45 ) ;  Raadln; 


Until  ( c 
easa  c  of 
•1*  « 
'2'  i 

'  3 '  . 

•  4*  , 

and; 


e  ■  *  1  * )  or  (c  ■  '2’)  or  (c 
of 

i  Entry. vigor  «■  Violant; 
i  Entry. vigor  i «  Enargatic; 
i  Entry. vigor  i *  Calm; 
i  Entry. vigor  t*  Inart; 


)  or  <c 


procadura  Writa^Concordanca; 
basin 

Paga;  Writaln;  Writaln;  Writaln;  Writaln; 

Writa  ('Stylist  la  now  writing  tha  concordanca  for  '); 

Writaln  (Info,Nama..of_taxt); 

Writaln;  Writaln; 

Writaln  ('Concordanca  will  ba  writtan  to  "Conerdne  taxt  a"'t50); 
Writaln;  Writaln; 

Writaln  (Conordnc);  Writaln  (Conerdne); 

Writaln  (Conerdne,  'CONCORDANCE' i 53 ) ; 

Writaln  (Conerdne,  Info.Nama  of_taxti55); 

Writaln  (Conerdne);  Writaln  (Conerdne); 

Inordar.Coneordanca  (Root); 
and; 


procedure  Inorder_Coneordance  (var  Root  i  antrypointer); 

(X  Thi®  i«  tho  moat  complex  procedure  in  the  program.  It  traverses  X) 

(*  the  Dixonerv  Binary  Search  Tree  (B$T)  in  inorder  fa*hion,  this  X) 

(X  visiting  each  important  word  usad  in  the  Intext  in  alphabetic  X) 

(X  order.  During  the  visit  to  each  word*  it  goes  down  the  concord  X) 

(X  list  one  at  a  time*  thus  examining  each  use  of  that  word  in  the  x) 

(X  order  it  was  used  in  the  Intaxt,  At  each  visit  down  the  ooncord  X) 

(x  list)  it  follows  the  Lastward  linked  list  to  find  the  phrase  in  X) 

(X  the  Intext  that  preceded  that  instance  of  the  use  of  the  word;  x) 

(X  prints  the  phrase* capitalizes  and  prints  the  word*  then  follows  x) 

( x  the  Naxtward  linked  list  to  find  and  print  the  phrase  following,  x) 

var  Lastward,  <X  pointer  to  word  previous  in  intext  X) 

Naxtward,  <x  pointer  to  word  following  in  intoxtx) 

Downward  >  concordpointar ;  (X  pointer  to  next  use  of  same  word  X) 

Phrasal*  (X  phrase  preceding  ke,  word  in  concordx) 

Phrase2  ■  Phrasetype;  (X  phrase  following  key  word  in  concordx) 

w*  (X  9  of  word  being  printed  X) 

1*  <x  t  of  latter  in  word  being  printed  x) 

offset* 
i  i  integer) 
o  i  ehsrj 
begin 

offset  i"  ord  PA’)  -  ord  <*e')) 

If  (Root  <>  nil)  then  begin 

Incrder  Concordance  (Root-, left)) 

Downward  t*  Root". down) 

While  (Downward  <>  nil)  do  begin 
w  i  ■  0) 

Lastward  t»  Downward". lest) 

while  (Lastward  <>  nil)  and  (w  <  Phreaelength)  do  bogin 
w  i  ■  w  +  1 ) 

PhraseK.w.)  i*  Lastward". up". word; 

Lastward  i*  Lastward" . last; 
end; 

for  i  i •  w  downto  1  do  begin 
1  i *  l; 

while  (1  <■  wordlength)  do  bogin 
if  ( Phrasal ( . i , ) ( , 1 , )  <>  •  ”)  then 

Write  (Conordno*  PhrasaK .i . )( .1 . )); 

1  i*  1  +  1; 

end; 

Write  (Concrdno,  '  * il); 
end: 

Write  (Concrdnc*  '  •); 

1  i*l; 

while  (1  <*  wordlength)  do  bogin 
c  i“  Root" . word( . 1 . ) ; 
if  (o  in  (.'a'..'*'.))  then 
e  i*  chr(ord<c)  +  offset); 
if  (Root"  word(.l.)  <>  •  ’)  than 
Writ#  (Conordno,  c); 

1  i*  1  +  1; 

•rid? 

Writs  (Concrdnc,  '  '); 

w  i  *  0; 

Naxtward  n*  Downward" . naxt; 
whila  (Naxtward  <>  nil)  and 

(w  <  Phraaelength)  do  bagin 
w  i=  w  +  1; 

Phrase2(.w.)  i»  Naxtward- . up" .word; 

Naxtward  t*  Naxtward- . next; 
and; 


for  i  i *  1  to  w  do  bogin 
l  i«  U 

whila  (1  <■  wordlanoth)  do  basin 

if  <Phraaa2< .A . ;( . 1 . )  <>  '  ')  than 
Writ*  (Concrdnc,  ,nhra»a2<  ,i . )( ,  1 , ) ); 
J.  * •  1  li 
and; 

Writ#  (Concrtrtc/  '  ' • 1 ) J 

and; 

Wri+aln  (Concrdnc); 

Downward  i*  Downward-' . down ; 
and; 

In ardor^Concordanco  (Roo^-<. right) ; 
and i  (K  i?  Root  <>  nil  *) 
and;  (X  procaduro  Inordar_Conoordanca  X) 


it 

It 

C« 

JJ) 

It' 


procadura  Caloulaia  Profila; 

var  numaioltrs  i  intapar;  (x  numbar  of  aionifioant  lattara  x) 

(X  iSUKS2wfi5*iwint*!*rja  «»<  of  aiSnifW  i!f2  S) 

bapin  *^*B  oontaxt,  atructuraal  worda  ara  inaienificant  *0 

?S  ■  0)  than  tally,  totalaantancaa  i  ■  1; 

if  ( tally. tot. 1 worda  «  0)  than  tally, totalworda .5 l;  *' 

numaigwrda  i»  tally, totalworda  -  tally, atructuraworda; 

ProfilS1^Btally!ilV,t0^*liBtt*rB  "  tllily * t°*«l*tnucltra; 
profila,Ava_ltra..par_word  i ■  nufflaiflltra/numaiowrda; 
^oProfila.Ava_wrda„par-«ant  .«  tally. totalworda/tally. totalaantancaa; 


(x  main  Raportar  procadura  x ) 


Sa°i*dUra  An*ly2*-Profil<»..*,^..Raport;  _  . 

*&?!'  Writaln;  Writaln; 

KliJii  i#  n2Vn!iyzinB  th<  ■tvi«  ot 

Writaln  C lnfo,Nama_of-,taxt); 

Writaln;  Writaln; 

WritJln;(wEi?2lJ;°f  in*ly*la  wil1  b*  writtan  to  "Raport  taxt  a”'i90)j 

Mr:f n  < R«Por*t> ;  Writaln  (Raport); 

Writaln  (Raport);  Writaln  (Raport); 

Calculata_Si*a  of^Taxt  (profila,  onalyaia); 

Calculata  Lang:t:h_of ^Worda  (profila,  analyaia); 

Caloulaf  *  I  annfK  _ _  ,  I _ ij  “  ,  s  . 

analysis); 


u,1  ou *•*•-»* flor„of_1Worda  (profil 
MaKa.Racomwandationa  (analyaia); 
and; 


procadura  Calculataj5i*a_of_Taxt  (profiia  i  profilatypa; 

var  analysis  i  analyaiatypa) ; 

var  i* 

ParCantFound  i  intagar; 
bagin 

Writaln(Raport);  Writaln(Raport) ;  Writaln(Raport); 

Writ*  (Raport*  'Total  of  aantanoaa  i  '); 

Writain  (Raport,  profiia , Totalaantancaa) ; 

Write  (Raport*  'Total  of  worda  t  '); 

Writain  (Raport*  profile.Totalworda); 

Writa  (Report*  'Total  of  iattara  i  '); 

Writain  (Report,  profile. totailattera); 

Writain  (Report);  Writeln(Report) ;  Writeln(Raport); 
if  (profile. totalworda  <  300)  than  bagin 

Writa  (Raport,  Profila.totalworda,  '  worda  ara  '); 

Writain  (Report,  'too  faw  for  valid  atatiatioal  analyaia.'); 

and 

alaa  if  (profiia . totalwords  <  1000)  than  bagin 

Writ#  (Roport,  Profila.totalworda*  '  worda  ara'); 

Writain  (Raport*  '  enough  for  valid  atatiatioal  analyaia.'); 
and 

alaa  bagin 

Writa  (Report*  Profiia. totalworda*  '  worda  ara  '); 

Writain  (Report*  '  planty  for  valid  atatiatioal  analyaia.'); 
and; 

Writain  (Raport);  Writaln(Raport);  Writoln(Raport) ; 

Writa  (Raport*  'Of  tha  '*  profiia. totalwordaii*  '  worda  in  '); 
Writain  (Raport*  Info.Naffla  of_.Taxt*  *,'); 

Writa  (Raport*  profila.numfoundti); 

Writain  (Raport*1  wara  matched  to  worda  in  tha  Styliat  dictionary.'); 
paroantfound  i«  profiia . numfound  M  100  div  profiia. totalworda; 

Writain  (Raport); 

Writain  (Raport*  Paroantfound i 2,  '  X  wara  matched. '); 
if  (paroantfound  <  SO)  than  bagin 
Writa  (Raport,  Inf o.Name_of  text); 

Writain  (Raport,  '  muat  contain  many  apaoialty  or  unique  worda.'); 
Writain  (Raport*  'Statiatioal  analyaia  ia  not  valid.'); 

and 

alaa  Writain  (Raport*  'Thla  ia  anough  for  atatiatioal  analyaia.'); 
if  (profiia. nawworda  >  0)  than  bagin 

Writa  (Report,  'You  added  ',  profiia. nawworda); 

Writain  (Raport*'  to  tha  dictionary  during  this  aaaaion.'); 
and; 

and;  (*  procadura  Calculata  Size  of  intaxt  X) 


procedure  Calculate_Length_of_Words  (profile  i  profiletype; 

v«r  analysis  >  analysisty pe); 

basin 

Graph,J.traParWord  (profile. ltraparword) ) 

Writaln  (Report))  Writaln  (Report)) 

Write  (Report,  'The  avarage  number  of  letters  per  word  i  '); 
Writaln  (Report,  profile.  Ave._ltra_par_.WQrd)) 

Writaln(Report) )  Writeln( Report ) ) 
if  (Info.Oenra  ■  NonFiction)  than  begin 

Writaln  (Report,  'A  typical  nonfiction  taxta  distribution  • ) j 
Write  (Report,  'resembles  a  low  bell-shaped  curve  ')) 

Writaln  (Report,  'centered  around  six  latters/word. ' ) ; 
if  (profila.Ave  LtrsPer_Word  >  7)  then 
Analysis. WordLength  i*  Toolong 
else  if  (profila.Ave  Ltrs_Par_Word  >  6.5)  then 
Analysis. WordLength  i 3  Long 
else  if  (profila.Ave  Ltrs_Per  Word  >  5.5)  then 
Analysis. WordLength  i*  medium 
else  if  (profile. Ave_Ltrs_Per_Word  >  5)  than 
Analysis. WordLength  >■  short 
else  Analysis. WordLength  ■  ■  tooshort 

end 

else  begin  (X  genre  is  fiction  X) 

Writaln  (Report,  'A  typical  fiction  texts  distribution  ')) 
Write  (Report,  '  resembles  a  tall  bell-shaped  curve  '); 
Writaln  (Report,  'centered  around  five  letters/word. ' )) 
if  (profila.Ave  Ltra  Per„Word  >6)  then 
Analysis. WoraLength  ■*  Toolong 
else  if  (profile.Ave_Ltrs_Per_.Word  >5.5)  then 
Analysis .WordLength  i*  Long 
else  if  (profila.Ave  Ltrs_Per  Word  >  5)  than 
Analysis. WordLength  t«  medium 
else  if  (profile. Ave_Ltrs_Par_Word  >  4)  than 
Analysis. WordLength  t*  short 
else  Analysis. WordLength  i“  tooshort) 
end) 

Writaln  (Report.)) 

Write  (Report,  'The  length  of  the  words  is  ')) 
case  Analysis. WordLength  of 

ToqShort  i  Writaln  (Report,  'too  short.')) 

Short  i  Writaln  (Report,  'short.')) 
medium  i  Writaln  (report,  'medium')) 

Long  i  Writaln  (Report,  'long.')) 

Toolong  i  Writaln  (Report,  'too  long.')) 

Writaln  (Report))  Writeln(Raport))  Writaln(Report) ) 
end;  (x  oalculata  length  of  words  x) 


r»rocadura  Graph_ttrapapword  (Itraparword  •  ltraparwordtypa); 
var  parJtrworde  i  itraparwordtypa; 

i»  i  i  intagar; 
basin 

Rasa  (Raport); 

Writaln(Raport);  WritalnCRaport);  WritalntRaport) ; 

Writaln  CRaport, 'BREAKDOWN  OF  PERCENT  OF  LETTERS  PER  W0RD'i60); 
Writaln<Raport>;  Writaln<Raport); 
for  i  i *  0  to  wordlangth  do 

ParltrWordaC . i . 1  t  *  ( LtraParWord( . i . )*100 )  div 

( tally . totalworda  -tally .atructurawords) ; 
far  i  i*  15  downto  1  do  bagin 
Writ*  (Raport,iX2<2,  *  ' >4); 

fop  j  i°  1  to  wordlangth  do 

if  <ParLtrWords< . j . )  >■  i*2)  than 
Writa  (Raport, '  X  • i 5) 
alaa  if  (ParLtrWordat . d . )  >•  (i*2)-l)  than 
Wpita  CRaport,'  *i5) 

alaa 

Wpita  CRaport,'  *  *  5 ) ; 

Writaln  CRaport); 
and; 

Wpita  CRaport, '  * i4); 

fop  i  i «  1  to  wopdlangth  do 
Writa  (Raport, i i 5) ; 

Wpitaln  (Raport); 

Writaln  CRaport, 'Numbar  of  Lattars  in  a  Word'i45); 
and; 


procedure  Cslculste_Lsngth_of..5sntsncss  (prefile  t  profiletyp*; 

var  analysia  i  enalysistype) ; 


var 

i> 

Sweetapots, 

Runons, 

PerCentRunona, 

ParcentSweetSpots  i  integer; 
begin 

3raph_WordaPerSent  (profile. WordsPerSent) ; 

Write  (Report,  'The  average  number  of  words  per  sentence  ■  '); 

Writeln  (Report,  profile, Ave_wrds_per  sent); 

Writeln(Report) ;  Writaln(R*port);  Writeln(Report); 

Write  (Report,  'A  typical  modern  texts  sentences  average  '); 

Writeln  (Report,  '  between  fifteen  and  twenty  words.'); 
if  (profile. a ve_wrds,„pe resent  >  22)  then 
Analysis . SentLength  t«  Toolong 
else  if  (profile, ave_wrdB„per_sant  >  18)  then 
Analysis .SentLength  i*  Long 
else  if  ( prof i la, ava_wrdaMper  sent  >  15)  then 
Analysis. SentLength  »■  Medium 
else  if  (profile, ava_wrds_per_aent  >  10)  than 
Analysia. SentLength  i *  Short 
else  Analysis. SentLength  i“  TooShort; 

Writeln  (Report); 

Write  (Report,  'Sentences  are  '); 
oase  analysis. SentLength  of 

TooShort  i  Writeln  (Report,  'too  short.'); 

Short  i  Writeln  (Report,  'short.'); 
madium  i  Writeln  (report,  'medium'); 

Long  i  Writeln  (Report,  'long.'); 

Toolong  t  Writeln  (Report,  'too  long.'); 
end; 

Sweetspots  i«  0; 

Runons  i»  0; 

for  i  i *  1  to  profile. toteleentences  do 

if  (profile ,wordspersent( .i . )  >  45)  then 
runons  i *  runons  +  1 

else  if  (profile.wordspersent( .1 . )  >  8)  end 

(profile. wordspersentC .i . )  <  20)  then 
sweetspots  t»  sweetspota  +  1; 

PerCentRunons  i»  runensXlOO  div  profiia . totalsentancas; 
if  (ParoantRunOna  >  5)  than  Analysia . Runons  i*  unaooaptabla 
alas  if  (PeroantRunOns  >  0)  than  Analysis. Runons  t*  acoaptabla 
alse  Analysis. Runons  i*  nonaxiatant; 

Writaln  (Report); 

Writeln  (Report,  'Number  of  run  ons  ',  runonei4); 

Writeln  (report,  'Percent  of  run  ons  percentrunons t 4) ; 

Writeln  (Report); 

Write  (Report,  'Run  ona  are  ')/ 
case  Analysis . Runons  of 

Nonexistent  i  Writaln  (Report,  'nonexistent .') ; 

Acceptabla  i  Writaln  (Report,  'acoaptabla.'); 

Unacceptable  t  Writaln  (report#  'unacceptable'); 
end; 

ParCentSwaetSpota  t «  swaatspotaXlQQ  div  prof ila , totalsentancas; 
if  (PercantSwaetapots  >  50)  than  Analysis .Modulation  ' *  Good 
else  if  (PercantSwaetspots  >  20)  than  Analysia. Modulation  i3  Avarags 
else  Analysis. Modulation  »■  bad; 

Writaln  (Raport); 

Writeln  (Report,  'Number  of  medium  length  sentences  ',  swaetspotn 4); 
Write  (Report,  'Percent  of  medium  length  sentences  '); 

Writeln  (Report,  percentsweatspots t 4 ) ; 

Writeln  (Report); 


Writ#  (Report,  'Modulation  is  *); 
cast  Analysis. Modulation  of 

Good  i  Nritaln  (Report,  '  good.'); 

Avarsga  i  Writaln  (Raport,  'avaraga'); 

Bad  (  Writaln  (raport,  'bad'); 
and; 

and;  (X  procaduro  analyza  langth  of  santancas  *) 
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procedure  Graph_Word*PerSent  (Graph  i  WordaPerSanttype) ; 

var  i ,  j,  paganum  <  integer; 

bagin 

Paganum  »■  1; 

While  (Tally .totalsentence*  >  (Paganum  -  1)  x  70)  do  bagin 
Page  (Report); 

Writaln  (Raport, 'BREAKDOWN  OF  NUMBER  OF  WORDS  PER  SENTENCE' .60); 
Writaln  (Raport, 'Number  of  words'); 

for  i  i"  1  X  ((Paganum  -  1)  x  70)  to  70  X  (Paganum)  do 
GrapH(.i.)  i»  (GraphC.i.)  div  2); 
for  i  i3  50  downto  1  do  bagin 
Writa  <Raport,iX2i2,  '  *il); 

for  j  i 3  1  X  ((Paganum  -1)  X  70)  to  70  X  (Paganum)  do 
if  (GraphC.j.)  >*  i)  than 

Writa  (Raport,  *X'.l)  alsa  Writa  (Raport,  '  'll); 
Writaln  (Raport); 
and; 

Writa  (Raport,  •  '); 

for  i  i*  1  to  7  do 

Writa  (Raport,  i  X  10  +  ((Paganum  -  1)  x  70 ) i 1 0 ) ; 

Writaln  (Raport); 

Writaln  (Raport,  'Santanca  Numbar* i 45) ; 

Writaln  (Raport); 

Paganum  ia  Paganum  +  1; 
and; 
and; 


procadura  Calculata_Etymology  (profila  t  profilatype; 

var  analysis  i  analysistypa) ; 

bagin 

Writaln  (Raport);  Writalnf Raport) ;  Writaln(Raport); 

Writaln  (Raport,  'ETYMOLOGY  OF  WORDS' t45); 

Wri taln( Raport ) ;  Wri taln( Raport ) ; 

Writa  (Raport,  'Numbar  of  Latinata  words  i  '); 

Writaln  (Raport,  profila, NumLatinata); 

Writa  (Raport,  'Numbar  of  Garmanic  words  i  •); 

Writaln  (Raport,  profila.NumGarmanic); 

Writaln  (Raport); 

if  (Info.Ganra  3  NonFietion)  than  bagin 

if  (profila. NumLatinata  >  profila . NumGarmaniC  X  3)  than 
Analysis. Etymology  i»  TooBorrowad 
alse  if  (profila. NumLatinata  >  profila. NumGarmanic  x  2)  than 
Analysis . Etymology  t  =  Borrowad 
alsa  if  (profila. NumLatinata  >  profila. NumGarmanic)  than 
Analysis . Etymology  t=>  Mixed 

also  if  (profila. NumLatinata  >  profila. NumGarmanic  X  0.9)  than 
Analysis . Etymology  t»  Nativa 
alse  Analysis . Etymology  i3  TooNativa; 

and 

alsa  (X  ganra  3  Fiction  X)  bagin 

if  (profile . NumLatinata  >  profila.NumGarmanic  X  2)  then 
Analysis . Etymology  i»  TooBorrowed 
else  if  (profile. NumLatinata  >  profila . NumGarmanic  X  1,5)  then 
Analysis. Etymology  i*  Borrowad 
else  if  (profila. NumLatinata  >  profila.NumGarmanic)  then 
Analysis. Etymology  i*  Mixed 

also  if  (profile. NumLatinata  >  prof i le , NumGarmanic  X  0.5)  than 
Analysis. Etymology  i»  Native 
else  Analysis. Etymology  t3  TooNativa; 
end; 

Writa  (Report,  ' Etymology  is  '); 
case  Analysis. Etymology  of 

TooBorrowad  i  Writain  (Report,  'very  borrowed.'); 

Borrowed  t  Writain  (Raport,  'borrowed , ' ) ; 

Mixed  i  Writain  (raport,  'mixed'); 

Nativa  i  Writaln  (Raport,  'native.'); 

TooNativa  i  Writaln  (Raport,  'vary  native.'); 
end; 

and;  (X  procedure  calculate  etymology  x) 


procedure  Calculat#„Difficulty_of_Vocabulary  (profile  i  profilatypa; 

var  analysis  i  analysistype) ; 
var 

PerCentPostOrad, 

ParCentGrad, 

PerCantHigh  »  integer; 
bagin 

Writaln  (Raport);  Writeln(Raport)  ;  Writeln(Report) ; 

Writaln  (Report,  'DIFFICULTY  OF  VOCABULARY ' i 45) ; 

Wri tain (Report) ;  Wri tain ( Raport ) ; 

Writa  (Raport,  '  PostGraduata  difficulty  t  '); 

Writaln  (Report,  profile. NumPostGrad); 

Writa  (Raport,  '  Graduate  difficulty  i  '); 

Writaln  (Raport,  profile. NumGrad) ; 

Writa  (Raport,  '  High  School  difficulty  i  '); 

Writaln  (Raport,  profile. NumHigh  School); 

Writa  (Report,  '  Elementary  difficulty  i  '); 

Writaln  (Raport,  profile . NumElemantary); 

Wri taint  Report);  Wri teln( Report ) ; 

parcantPostgrad  i«  profile, numpoatgradXlOQ  div  profile , numfound; 
parcentOrad  i=  profile . numgradxl 00  div  profi la. numfound; 
parcanthigh  >■  profile.numhigh_schoolXlOO  div  profile. numfound; 
if  (parcantPostgrad  >  0)  than  analysis. HardWords  t«  varyhard 
alsa  if  (parcantgrad  >  5)  than  analysis , HardWords  t*  hard 
else  if  (parcanthigh  >  10)  than  analysis. HardWords  >*  challenging 
alsa  analysis. HardWords  i*  easy; 

Writa  (Raport,  'Percent  of  Postgraduate  difficulty  '); 

Writaln  (Raport,  percentpostgrad) ; 

Writaln  (Raport,  'Percent  of  Graduate  difficulty  ',  parcantgrad); 

Writaln  (Raport,  'Parcant  of  High  Sohool  difficulty  '  ,  parcanthigh); 
Wri tain (Report);  Wri taint Raport); 

Writa  (Raport,  'Difficulty  is  '); 
case  Analysis.HardWords  of 

Varyhardi  Writaln  (Report,  'vary  hard'); 
hardi  Writaln  (Report,  'hard'); 
challenging!  Writaln  (raport,  'challenging'); 
easy  i  Writaln  (Raport,  'easy.'); 
and; 

and;  (x  Calculata_Difficulty..of_,Vocabulary  x) 


procadura  Calculata_Tangibility  (profila  .  profilatypa; 
bgflin  var  analysis  .  analysiatypa); 

Writaln  (Rapori);  Writaln(Raport);  Writaln(Raport); 

Writaln  (Raport,  'TANGIBILITY*  .45);  P  1 

WritalnCRaport);  Writaln(Raport) ; 
y.J*.  g^Popt,  '  Numbar  of  Tangibla  words 

nr}J*ln  < R«por-t,  profila. NumTanoibla); 

Hr:rt*  55*pori'  '  Number  0f  fntangibla  words 

profila. Numlntangibla) ; 
if  (Info  Ganra  *  NonFiction)  than  bagin 

,f  LB.rf^;:?.us^j;fi,$Ir.>ssf?,11,'Nu”T,ngibl*  *  ,j th" 

'"■»«‘«'Nu»T.n»ibl.  «  Z)  th.n 
alsa  Analysis, Tangibility  .=  Solid; 

•nd 

C#  sanra  »  Fiction  X)  bagin 

Wo«U.Nu.T«wibL  «  l.S)  th.b 

alsa  Analysis. Tangibility  . ■  Solid; 

•no; 

Writaln  (Raport); 

Wnta  (Raport,  'Tangibility  ia  •); 
casa  Analysis. tangibility  of 

Solid  .  Writaln  (Raport,  •  vary  tangibibla. ') ; 

'  Britain  (Raport,  »  tangibibla,'); 

^^jSoft  i  Wntoln  (Raport,  '  vary  intangibibla. ' ); 

and;  (x  calculata  tangibility  X) 


procedure  Calculate.„Emotional_Tone  (profila  i  profdletypa; 

var  Analysis  i  analysistype); 
var 

PerCantSublima, 

ParCentPlsssant, 

Parcantunplaasant, 

ParCantHorrid, 

Emotionlndax  <  integer; 
bagin 

Writaln  (Raport);  Writeln(Raport);  Writaln(Report); 


Writaln  (Report, 
Writeln(Report)  ; 
Write  (Report,  ' 
Writaln  (Raport, 
Writ*  (Raport,  ' 
Writaln  (Report/ 
Writ#  (Raport>  ' 
Writaln  (Raport/ 
Writa  (Raport/  ' 
Writaln  (Raport/ 
Writa  (Raport/  • 
Writaln  (Raport/ 
Writaln(Raport) ; 

ParoantSublima  i 3 


'EMOTIONAL  CONNOTATIONS ' i 45) ; 
WritalnCRaport); 

Sublima  connotations  t  '); 

profila. NumSublima) ; 

Plaasant  connotations  »  'if 

profila, Numplaasant) ; 

Nautral  connotations  i  ' ) t 

profila, NumNautral ) ; 

Unplaasant  connotations  i  'if 

profila , Numunplaasant) ; 

Horrid  oonnotationa  t  ' ) ; 

profila.NumHorrid) ; 

Writaln( Raport);  Writaln (Raport); 
profila. nufflsublimaXlOO  div  profila.numfound; 


paroantPlaasant  i*  profile . numplaasantxiOO  div  profila.numfound; 
paroantUnplaasant  i»  profila.numunplaasantMlOO  div  profila. numfound; 
parcanthorrid  i*  profile. numhorridXlOO  div  profils. numfound; 
Emotionlndax  i *  (parcantsublima  x  5)  + 

(parcantplaasant  *  2)  + 

(parcantunplaasant  K  2)  ♦ 

(parcanthorrid  *  5); 

if  (Emotionlndax  >  20)  than  Analysis. amotionality  t*  rich 
alsa  if  (Emotionlndax  >  10)  than  Analysis. amotionality  t«  standard 
also  Analysis. amotionality  i*  poor; 

Writaln  (Raport/  'Parosnt  of  sublima  oonnotationa 
' Parcant  of  plaasant  oonnotations 
'Paroant  of  unplaasant  connotations 
paroantUnplaasant); 

•Parcant  of  horrid  connotations 


(Raport/ 
(Raport, 

( Report t 

(Raport, 

(Raport); 

(Raport, 

(Raport) ; 

(Raport, 


•  /ParoantSublima) ; 

paroantPlaasant) 

•); 


.parcanthorrid); 


'Indax  of  Emotionality 
•Emotionality  is  •); 


Emotionlndax); 


X  1.2)  than 


Writaln 
Writa 
Writaln 
Writaln 
Writaln 
Writaln 
W-italn 

Writa  .  .... 

casa  Analysis . Emotionality  of 
Richi  Writaln  (Raport,  'Rich'); 

Standard  t  Writaln  (Raport,  'average'); 

Poor.  Writaln  (raport,  'poor'); 
and; 

if  ( ( (parcantsublima  *  5)  -^parcantplaasant)  > 

( (porcanthorrid  «  5)  +  parcantunplaasant) 

Ana  lysis. Toms  »*  poaitiva 
alsa  if  (((parcantsublima  X  5)  -^parcantplaasant)  X  1,2  < 

( (parcanthorrid  X  5)  +  parcantunplaasant))  than 
Analysis . Tons  i *  nagativa 
alsa  Analysis. Ton*  bland; 

Writaln  (Raport); 

Writa  (Raport,  'Ti«na  is  '); 
casa  Analysis. Tons  of 

Positivai  Writaln  (Raport,  'Positiva'); 

Bland,  if  (analysis . amotionality*  rich)  than  bagin 

Write  (Raport,  'a  balance  of  strong  positiva  '); 
Writaln  (Report,  'and  strong  nagativa  amotions.'); 
and  alsa  Writaln  (Raport,  'bland.'); 
nagativai  Writaln  (report/  'nagativa'); 
and; 

and;  (x  procedura  calculata  amotional  tons  X) 


proeadura  Calculsta..Vigor_of_Worda  (profila  i  profilatypa; 

var  analysis  i  analysistypa) ; 
var 

ParCantViolant# 

ParoantEnargstio# 

ParcantCalm# 

Strangthlndax  t  intagar; 
bagin 

Writaln  (Raport);  Writaln(Raport) ;  Writaln(Raport); 

Writaln  (Raport,  'VIGOR  OF  WORDS' i45); 

Writaln(Raport) ;  Writaln(Raport) ; 

Writs  (Raport#  '  Wopds  of  Extrama  Vigop  i  '); 

Writaln  (Report#  profila.HumViolant);  Writaln; 

Writa(Report#  '  Wopds  of  Much  Vigor  i  '); 

Writaln  (Report#  profila.NumEnargatic); 

Writs  (Raport#  '  Word*  of  Soma  Vigor  i  '); 

Writaln  (Raport#  profila . NumCalm) ; 

Writs  (Report#  •  Wards  of  Littla  Vigor  >  ' 

Writaln  (Report#  profila . numinart) ; 

Writaln(Raport) ;  Writaln(Raport) ;  WritalnCRaport) ; 
parcantViolant  i«  profila, numviolantXlOO  div  profila, numfound; 
parcantEnargatic  i«  profila, numanargaticXIOO  div  profila. numfound; 
parcantCalm  i"  profile. numcalmXlOO  div  profila.numfound; 
Strangthlndax  i *  parcantviolant  x  10  + 

parcantanargatic  x  5  + 
percentcalm; 

if  (Strengthlndex>  60)  than  Analysis . Strangth  t*  VaryStrong 
alsa  if  (Strangthlndax  >  50)  than  Analysis, Strangth  i*  strong 
alsa  if  (Strangthlhdax  >  20)  than  Analysis. Strangth  t ■  livaly 
alsa  Analysis. Strangth  ia  weak; 

Writs  (Raport#  'Paroant  of  words  of  axtrama  vigor  i  '); 

Writaln  (Raport#  parcantviolant) I 

Writs  (Raport#  'Parcant  of  words  of  much  vigor  i  '); 

Writaln  (Raport#  parcantEnargatic) ; 

Writs  (Raport#  'Parcant  of  words  of  soma  vigor  i  '); 

Writaln  (Raport#  parcantCalm) ; 

Writaln  (Raport); 

Writaln  (Raport#  'Indax  of  Vigor  '#  Strangthlndax); 
Writaln(Raport) ; 

Writs  (Raport#  'Vigor  is  '); 
casa  Analysis, Strangth  of 

VaryStrongi  Writaln  (Raport#  'vary  strong' ) ; 
strongi  Writaln  (Raport,  'strong'); 
livalyi  Writaln  (raport#  'livaly'); 
waak  i  Writaln  (Raport#  'waak.'); 
and; 

and;  (X  prooadura  caloulata  vigor  of  words  x) 


procedure  Write_NumRecom  (Var  Numracom  i  intagar); 
bagin 

NumRacom  i«  NumRacom  +  1; 

Writaln (Raport);  Wri taln( Raport ) ; 

Writaln  (Raport,  'RECOMMENDATION  NUMBER  '«55#  NumRacomt2); 
Writaln( Raport);  Wri taln( Raport ) ; 
and; 


procedure  MeKe^Recommendetione  (analysis  ■  analysistype); 

var  NumReeom  >  integer;  (X  |  of  recommendations  made  so  far  ft) 

begin 

Writeln  (Report);  Writeln  (Report); 

NumReeom  <■  0; 
with  analysis  do  begin 

if  (Runons  *  Unacceptable)  then  begin 
Write_NumRecom  (NumReeom); 

Writeln  (Report,  'You  tend  to  write  run-on  sentences.'); 

Writeln  (Report,  'Check  your  longest  sentences  for  run  ons.'); 
Writeln  (Report,  'Break  them  up  into  units  of  single  ideas.'); 

end; 

if  (Sentlangth  3  TooLong)  end  (HardWords  >  Easy)  then  begin 
Write.NumRecom  (NumReeom); 

Write  (Report,  'Your  average  sentences  are  too  long  '); 

Writeln  (Report,  'for  the  difficulty  of  your  vocabulary.'); 
Writeln  (Report,  'Use  simpler  words  or  shorter  sentences,') 

and; 

if  (Modulation  ■  Bad)  then  begin 
Writa.NumReeom  (NumReeom); 

Write  (Report,  'Your  aantencaa  tend  to  be  too  short  or  too'); 
Writeln  (Report,  'long.  Try  to  moderate  and  modulate  '); 

Writeln  (Report,  'the  length  of  your  aantencaa. ' ); 

end; 

if  (Etymology  ■  TooBorrowed)  then  begin 
Write  NumReeom  (NumReeom); 

Writeln  (Report,  'Use  shorter,  more  native  English  words.'); 

end; 

if  (Emotionality  3  Poor)  then  begin 
Write  NumReeom  (NumReeom); 

Writeln  (Report,  'Use  more  words  that  provoke  emotions.'); 

and; 

if  (Info.Oenre  ■  Fiction)  then  begin 
if  (Strength  <  Strong)  then  begin 
Write  NumReeom  (NumReeom); 

Writeln  (Report,  'Use  more  evocative,  sensory  words.'); 
end; 

if  (Etymology  <  Mixed)  and  (Hardwords  >  Challenging)  and 
(SentLength  >  Medium)  than  begin 
Write  NumReeom  (NumReeom); 

Writeln  (Report,  'Your  fiction  reeda  like  non-fiction.'); 
Writeln  (Report,  'Unless  you  are  tergatting  a  highly  '); 
Writeln  (Report,  'literate  eudienoe,  keep  it  simple.'); 

if  (Tangibility  3  Soft)  then  begin 
Write_NumReoom  (NumReeom); 

Write  (Report,  'A  narrative  should  be  oonorata  and  '); 
Writeln  (Report,  'detailed.  Describe  things,  not  ideas.  '); 
and; 

end  else  begin  (X  genre  is  nonfiction  x) 

if  (Etymology  «  TooNative)  and  (Strength  3  VeryStrong)  then  begin 
Write  NumReeom  (NumReeom); 

Writeln  (Report,  'Your  non-fiction  reads  like  fiction.'); 

Write  (Report,  'Ask  yourself  if  it  is  too  vigorous  '); 
Writeln  (Report,  '  for  the  audience  that  will  reed  it.'); 
end; 

if  (Strength  •  Weak)  then  begin 
Write  NumReeom  (NumReeom); 

Writeln  (Report,  'Use  more  evocative,  sensory  words.'); 
and; 
end; 
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if  (Strangth  >  Livaly)  and  (Emotionality  >  Standard)  and 
(Modulation  ■  Good)  than  bapin 

Writa  (Raport,  'Congratulations!  You  writa  with  •); 

Writaln  (Raport,  'strangth  and  graca,  '); 
and; 

if  (Runona  *  nonaxiatant)  than  bagin 

Writa  (Raport,  'Congratulationa!  You  navar  aaam 
Writaln  (Raport,  'to  writa  run-on  aantancaa. ' ) ; 

and; 

If  (NumRacom  ■  0)  than  bagin 
Writaln  (Raport); 

Writa  (Raport,  'This  ia  a  solid  piaoa  of  writing 

Writaln  (Raport,  'wall  within  tha  tradition*  of  its  ganra.'); 

Writaln  (Raport); 

Writa  (Raport,  'You  ara  as  abla  to  undarstand  tha  moaning  of'); 
Writaln  (Raport,  '  tha  abova  ehamctariatics  a*  Tha  Stylist.'); 
Writaln  (Raport,  'Stand  tha  coursa!'); 

and; 

and;  (X  with  analysis  do  X) 

and;  (X  procadura  maka  racommandationa  X) 


Sroeadura  $tora_Naw_Dix) 

•gin 

Rewrite  (Dixnnary,  'Dixonary  text  a')) 

Paga)  Writain)  Writain)  Writain;  Writain) 

Writ#  ('Stylist  is  now  storing  tha  naw  dictionary. ' i55)) 
StoraJMT  (Root)) 
and; 


procadura  Stora_BST  (p  i  antrypointar) ) 
bagin 

if  (p  <>  nil)  than  bagin 

if  (p-. status  *  storabla)  than  Stora  (p-)j 
Stora^BST  (p-.laft)j 
Stora_BST  (p-. right)) 
and) 
and) 


procadura  Stora  (Entry  i  antrytypa)) 
var 

aourcaltr, 
diffi cultyltr, 

concratanassltr,  omotionltr, 
vigorltr  i  char) 
bagin 

with  antry  do  bagin 

if  (sourca  *  Latinata) 


caaa  difficulty  of 


than 

•Isa 


sourcaltr 

aourcaltr 


i  a 

1  a 


4' 

'g') 


Postgrad  > 

Grad  ■ 

High_School  i 
Elamantary  t 
and) 

if  ( concratanaas 


of 


difficultyltr 
diffioultyltr  i» 
difficultyltr  i* 
difficultyltr 


'P') 

»g') 

•h') 

'•') 


a  Tangibla) 


casa  amotion 
Sublima 

Pleasant 

Nautral 

Unplaasant 

Horrid 

and; 

casa  vigor  of 
Violent 
Energetic 
Calm 
Inert 
and) 

and)  (X  with  antry 
Writain  (Dixonary, 


than 

•  laa 


concratanaaaltr 

concratanassltr 


» " 
I  a 


»t 

•i 


•motionltr 

•wotionltr 

•motionltr 

•motionltr 

•motionltr 


i  ■ 

I  a 
i  a 

l  * 
I  a 


•a* 


•n') 
'  u ' ) 
*h») 


vigorltr 

vigorltr 

vigorltr 

vigorltr 


I  a 

l  ■ 

i  a 

i  ■ 


'V') 
'•') 
'  c ' ) 
'i') 


do  x) 

Entry. word,  aourcaltr,  difficultyltr, 
conoratanasaltr,  amotionltr,  vigorltr)) 


and) 


APPENDIX  D 

THE  CODE  OF  DIXSPLIT  AND  DIXJOIN 


<*  NOTE  i  THE  FOLLOWING  IS  NOT  IN  EXECUTABLE  ORDER. 

IT  IS  IN  THE  ORDER  OF  GREATEST  CONCEPTUAL  CLARITY.  >0 

(*♦560000*0 

program  Dixsplit  (input*  output); 
const 

maxsent  *  1 0 0  0 ; 
wordlangth  3  15; 
linolength  ■  60; 
phrasalength  «  A; 
freunum  ■  50; 
aalnum  *  200; 
typa 

Ganratypa  ■  (Nonfiction*  Fiction); 

Freouancytypa  *  (Frequently#  Saldomly#  Whan_Dona); 

Balancatypa  ■  (Plus*  Zero,  Minus); 

Durationtypa  *  (Parmanant*  Temporary); 

Sourcatypa  ■  (Latinate,  Germanic); 

Difficultytype  *  (PostOrad*  Grad*  High  School#  Elamantary); 
Concretanasstypa  »  (Tangible*  Intangible); 

Emoti ontype  *  (Sublime*  Pleasant*  Nuatral*  Unpleasant*  Horrid); 
Vigortypa  *  (Violent*  Enargatic*  Calm*  Inert); 

Wordtypa  *  paoked  array  ( .1 . .wordlangth. )  of  char; 

Linotype  *  packed  array  ( .1 . . linelangth. )  of  char; 

Phrasotype  ■  packed  array  < . 1 . . phraaeisngth. )  of  wordtypa; 
EntryPointor  B  -'Entrytypa; 

ConoordPointar  a  •'Concordtypej 
Entrytypa  «  record 
Word  t  Wordtypa; 

Balance  i  Balancatypa; 

Duration  i  Durationtypa; 

Source  t  Sourcatypa; 

Difficulty  i  Difficultytype; 

Concrotanaas  t  Concratanasstypa; 

Emotion  t  Emotiontypa; 

Vigor  i  Vigortypa; 

Left*  Right  i  EntryPointor; 

Down  ■  Concordpoi nt#r; 
and; 

Concordtype  3  record 
Up  i  EntryPointor; 

Down, 

Next,  Last  i  ConcordPointer; 
and; 


91 


var 

FourRoot, 

Root  i  EntryPointar; 
Lastward  i  ConcordPointar; 
Latinat, 

Garmanc, 

Tangbla, 

Intngbla, 

Poatgrd, 

Graduato> 

HighSchl , 

Elamanta. 

Sublma, 

Plaasnt, 

Nuatrl , 

Unplaasn, 

Horrd, 

Violnt, 

Enargati/ 

Clm» 

Inrt> 

Dixonary  i  taxt; 


procadura  Attach  (Entry  t  antrytypa;  var  p 
basin 
Naw  (p); 

with  do  bagin 

word  i*  antry.word; 
duration  i«  parmanant; 
aourca  i ■  antry. aourca; 
difficulty  t«  antry. difficulty; 
concratanaaa  t«  antry . concratanaaa; 
amotion  i*  antry, amotion; 
vigor  i«  antry. vigor; 
laft  i«  nil;  right  ia  nil; 
down  i»  nil; 
and; 
and; 


i  antrypointar); 


u 
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procadura  Encoda  C i •  antrytypa;  var  xaourca  i  char; 
var  xdifficulty  i  char; 

var  xconcratanaaa  i  char;  var  xamotion  >  char 
var  xvigor  •  char); 

begin 

if  (its.aourca  ■  Latinata)  than  xaourca  i»  1 

alaa  xaourca  i=  'o'; 

caaa  ita. difficulty  of 

Poatorad  i  xdiffioulty  ««  'p'; 

Grad  t  xdifficulty  i»  'g'; 

High_School  i  xdifficulty  ia  'h'; 

Elamantary  i  xdifficulty  i"  'a'; 

and; 

if  ( i ts . concratanass  =  Tangible)  than  Xconcratanaas  i“  *t' 

alsa  Xconcratanaas  13  ' i ' ; 


caaa  its. amotion  of 

Subiima  i  xamotion 

Plaaaant  «  xamotion 

Nuatral  i  xamotion 

Unpleasant  t  xamotion 

Horrid  ■  xamotion 

and; 

caaa  ita. vigor  of 

Violent  i  xvigor  t* 

Energetic  t  xvigor  t * 

Calm  i  xvigor  ■■ 

Inart  i  xvigor  i* 

and; 
and; 


t  3  's' ; 
i*  'p'; 
i  ■  'n* ; 
i  ■  •  u  • ; 
t»  •  h • ; 


•v»; 
•a'; 
•o'; 
•i '; 


prooadura  Decode  (var  ita  <  antrytyga; 

xaourca  i  char;  xdifficulty  i  char; 
xooncratanaaa  i  char;  xamotion  t  char; 
xvigor  i  char); 

begin 

if  (Xaourca  3  *1*)  than  its.aourca  i *  Latinata 
alsa  its.aourca  i*  Germanic; 


caaa  Xdiffioulty  of 

'p'  t  ita. difficulty  i« 

•g'  . . 

•h* 


i  ita. difficulty 
i  its. difficulty 


Postgrad; 
Grad; 


.i“  High_Sohool; 
•a'  i  ita. difficulty  i»  Elementary; 


and; 

if  (XoonoratanasaB 


•t')  than  ita.ooncratanasa  i*  Tangible 
alaa  ita. conoratanaaa  >■  Intangible; 


case  Xamotion  of 

•s'  t  ita. amotion  t«  Subiima; 

•p*  t  ita. amotion  i*  Plaaaant; 

* n 1  •  ita. amotion  i*  Nuatral; 

•u'  i  its. amotion  i*  Unplaasant; 
•h'  i  ita. amotion  ■«  Horrid; 
and; 

case  Xvigor  of 

•v'  i  ita. vigor  i*  Violent; 

•a'  i  ita. vigor  i*  Energetic; 

'o'  i  its. vigor  i»  Calm; 

•i'  i  ita.  vigor  < ■  Inart; 
and; 
and; 
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procedure  AVL_Insert  (Entry  i  entrytype;  ver  p  i  Entrypointar; 

ver  belenced  i  boolean); 
ver 

pl*  p2  <  entrypointer; 
begin 

if  (p  ■  nil)  then  begin 
Attach  (Entry*  p); 

P-. balance  »■  Zero; 
balanced  »■  true; 

else  if  (Entry. word  «  p-.word)  than  Writeln  ('INSERT  COLLISION') 
alaa  if  (Entry. word  <  p-,word)  than  begin 
AVL  Insert  (Entry*  p-.laft*  balanced); 

If  Balanced  than  (X  left  pointer  has  grown  higher  x) 
case  p*  .balance  of 

Plus  i  begin  p-. balance  t *  Zero;  balanced  t  =  false;  ond; 

Zero  t  p-. balance  t=  Minus; 

Minus  i  begin  (x  rebalance  X) 
pi  t«  p-.left; 

if  (pi-. balance  »  Minus)  then  begin(K  single  LL  rotatX) 
P-.left  i»  pi-. right; 
pl-. right  t«  pj 
P-. balance  i«  Zero; 

P  i"  pi; 
and  (X  if  X) 

else  begin  (X  double  LR  rotation  x) 
p2  »■  pl-. right; 
pl-. right  i*  p2-.left; 
p2-.left  «■  pl; 

P-.left  i»  p2-. right; 
p2-. right  »■  p; 

if  (p2- .balance  ■  Minus)  then  p-. balance  i*  Plus 
else  p-. balance  i»  Zero; 

if  (p2-. balance  ■  Plus)  than  pl-. balance  i ■  Minus 
else  pl-. balance  t*  Zero; 

P  t"  p2; 
end;  (X  else  X) 

P-. balance  i*  Zero;  balanoed  «■  falaa; 
end;  (x  case  of  Minus  X) 
end;  (X  of  aaaes  X) 
end  (x  if  Entry. word  <  p-.word  X) 


•ls«  if  C  Entry. word  >  p-.word)  than  bagin 

AVL  Insort  (Entry,  p-.. right, ,  balanced); 
if  balanced  than  (X  right  pointer  has  grown  highar  X) 
oasa  p-.balanca  of 

Minus  i  bagin  p-. balanca  i«  Zaro; 

balancad  i »  falsa;  and; 

Zaro  i  p-<.balanca  i«  Plus; 

Plus  i  bagin  (X  rabalanoa  X) 
pi  i*  p-. right; 

if  (Pl'.baianca  «  Plus)  than  bagin 
(X  singla  RR  x) 

P7* right  <■  pl-.laft; 

Pi". left  !»  p; 

P-'.baianca  i*  Zaro; 

,  p  1 =  pi; 
and  (X  if  M) 

also  bagin  Cx  doubla  RL  rotation  X) 

P2  i«  pl-.iaft; 

pZ-*. right; 
p2->. right  i  •  pi; 

P-. right  i*  p2- .laft; 
p2-.laft  p; 

if  ( p2->.  balance  ■  Plus)  than 
P-. balance  t*  Minus 
also  p-, balanca  i«  Zaro; 
if  (p2-. balanca  ■  Minus)  than 
pi". balanca  i>  Plus 
alsa  pi->. balanca  i*  Zaro; 

.  P  '*  p2; 

end;  (X  doubla  RL  rotation  X) 
p-'.baianca  i»  Zaro;  balanoad  t ■  falsa; 
and;  (X  c aae  of  balanca  »  Plus  X) 
tu  .  .  .  -  ,  <#  of  oases  X) 

end  (X  of  if  Entry. word  >  p-.word  X) 
also  balanced  t»  fal.ita; 
and;  (X  of  procedura  AVL^lnaart  x) 


»  V-.M>  .<  »\  £«*•«.  1-*  '  V  'll/ 


procedure  Init_Resaaroher; 
var 
i» 

stepcount. 
threshhold. 
dixlength  *  integer; 
a*  d,  c*  a*  v  i  char; 
entry  i  antrytypa; 
balanced  i  boolean; 
begin 

Reset  (Dixonary/  *  Dixonary  text  a'); 

New  (Root); 

Readln  (Dixonary.  Root-. word.  s.  d.  c,  a.  v); 

Decode  (Root-,  s.  d>  c.  e.  v)j 
Root-. left  i=  nil;  Root-. right  i=  nil; 

Root-. down  i »  nil; 

Root- . balance  <  =  Zero; 

Root- . duration  i«  Permanent; 

Lastward  <*  nil; 

Stepcount  ia  0; 

while  not  EOF  (Dixonary)  do  begin 

Readln  (Dixonary.  Entry. word.  a.  d.  c,  a.  v); 

Decode  (Entry,  a.  d.  c.  e.  v); 

Entry. balance  13  Zero;  balanced  is  falaa; 

AVL_Insart  (Entry.  Root,  balanced); 
stepcount  1*  stepcount  +  1; 
if  (stepcount  mod  200  ■  0)  then  begin 

Page;  Writeln;  Writeln;  Writeln;  Writeln; 

Writeln  ("Stylist  is  now  loading  its  dictionary.'); 
Britain  ("Last  word  loaded  was  entry .word  1 16 ) ;  Writeln 
Writeln  ("Stylist  has  loaded  ' .stepcount t5. '  entries."); 
Writeln;  Writeln; 
end;  (x  if  x) 
end;  (X  while  not  EOF  X) 
end; 


procedure  Inorder  (var  F  *t  t  antrypointer)  ; 
begin 

if  (Root  <>  nil)  then  begin 
Inorder  (Root-*. left); 
case  Root** .  Source  of 

Latinete  i  Writeln  (Latmat, 

Oermenic  i  Writeln  (Germane, 

end; 

case  Root-1.  Concreteness  of 

Tangible  ■  Writeln  (Tangble, 


Root-*  .word, 
Root- .word, 


Root-. word, 


'1*  .1); 
V  .1); 


't*  .1); 

n  i 


intangible  i  Writeln  (InTngble,  Root-. word,  *i'il); 

and; 

oase  Root- . difficulty  of 

Postgrad  i  Writeln  (Postgrd,  Root-. word,  * p • 1 1 ^ ; 

Grad  i  Writoln  (Graduate,  Root-. word,  * o *  si); 
High_School  t  Writeln  (HighSchl,  Root-. word,  'h'll); 
Elementary  i  Writeln  (Elementa,  Root-. word,  'e'tl); 

end; 

case  Root-. emotion  of 

Sublime  t  Writeln  (Sublme,  Root-. word,  's'tl); 
Pleasant  i  Writeln  (Pieasnt,  Root-. word,  'p'iD; 
Nuetral  i  Writeln  (Nuetrl,  Root-. word,  * n • « 1 ) ; 
Unpleasant  i  Writeln  (Unpleasn,  Root-. word,  ’u'll); 
Horrid  ;  Writeln  (Horrd,  Root-. word,  * h *  *  1 ) ; 

and; 

case  Root-. vigor  of 

Violent  i  Writeln  (Violnt,  Root-. word,  'v’tl); 
Energetic  t  Writeln  (Energeti,  Root-. word,  ‘e’); 
Calm  i  Writeln  (Clm,  Root-. word,  'c'); 
inert  i  Writeln  (Inrt,  Root-. word,  ’ i • ) ; 

end; 

Inorder  (Root- . right) ; 


end; 

end; 


97 


begin  (*  mein  program  *) 

Init_Re*aarcher; 

Rewrite  (Letinat,  'Latinat  text  a*); 
Rewrite  (Germane,  'Germane  text  a'); 
Rewrite  (Tangbl*,  'Tenable  text  a'); 
Rewrite  (Xntngble,  'Intngbla  text  a'). 

Rewrite  (Poatgrd,  'Poatgrd  text  a'); 
Rewrite  (Graduate,  'Graduate  text  a'); 
Rewrite  (KigHSohl,  'HighSchl  text  a * ); 
Rewrite  (Element*,  'Element*  text  o'); 
Rewrite  (Sublme,  'Sublme  text  a'); 
Rewrite  (Pleasnt,  'Pleasnt  text  a'); 
Rewrite  (Nuetrl,  'Nuetrl  text  a'); 
Rewrite  (Unpleasn,  'Unpleesn  text  a'); 
Rewrite  (Horrd,  'Horrd  text  a'); 
Rewrite  (Viol  .t,  'Violnt  text  a'),‘ 
Rewrite  (t'nergeti,  ‘Energati  text  a*); 
Rewrite  (Clm,  'Clm  text  a')i 
Rewrite  (Inrt,  'Inrt  text  a'); 

Inorder  (Root); 
end . 


I) 

I 


Of*S60000X) 

program  Dixjoin  (input,  output); 
const 

maxsent  =  1000; 
wordlength  *  15; 
linalength  *  60 ; 
phrasalangth  3  4; 
freqnum  3  50; 
selnum  3  200; 

type 

Genratypa  =  (Nonfiction,  Fiction); 

Fraquencytypa  3  (Fraquantly,  Saldomly,  Whan_Dona); 

Balancatyps  3  (Plus,  Zaro,  Minus); 

Durationtypa  3  (Parmananf,  Temporary); 

Sourcetypa  3  (Latinate,  Germanic); 

Diff i cultytype  =  (PostGrad,  Grad,  High  School,  Elamantary); 
Concratenasstypa  3  (Tangibls,  IntangibTa); 

Emottontype  3  (Sublima,  Plaasant,  Nuatral,  Unplaasant,  Horrid); 
Vigortype  3  (Violsnt,  Enargatic,  Calm,  Inart); 


Wordtypa  3  packad  array  ( .1 . .wordlangth. )  of  char; 

Linatypa  3  packad  array  ( . 1 . . linalangth . )  of  char; 
Phrasatypa  3  packad  array  (. 1 .. phrasalangth. )  of  wordtypa; 

EntryPointar  “  -Entrytypa; 

ConcordPointar  3  -Concordtypa; 

Entrytypa  3  racord 
Word  i  Wordtypa; 

Balance  >  Balancatypa; 

Duration  i  Durationtypa; 

Source  :  Sourcatypa; 

Difficulty  t  Difficultytype; 

Concratanass  i  Concratanasstypa; 

Emotion  i  Emotiontypa; 

Vigor  i  Vigortype; 

Laft,  Right  t  EntryPointar; 

Down  i  Concordpointer; 
and; 

Concordtypa  3  record 
Up  i  EntryPointar; 

Down, 

Naxt,  Last  t  ConcordPointar; 
end; 
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var 

FourRoot, 

Root  t  EntryPointar; 

Lastward  i  ConcordPointar; 

Latinat, 

Germane, 

Tangble, 

Intngbla, 

Postgrd, 

Graduate, 

HighSchl , 

Elemanta, 

Sublma, 

Pleasnt, 

Nuatrl, 

Unplaasn, 

Horrd, 

V iolnt, 

Enargeti, 

Cim, 

Inrt, 

Dixonary  i  text; 
word  i  wordtype; 
p  i  antrypointar; 
c  i  char; 

bagindUr*  AU*ch  (Entry  '  antrytype;  var  p  .  antrypointar); 
New  <p); 

with  p-  do  bagin 

word  i ■  ontry. word; 
duration  «=  permanent; 
sourca  i*  entry .source; 
difficulty  i=  an try .difficulty; 
concreteness  i»  entry . concreteness; 
amotion  t=  antry, amotions 
vigor  i=  antry. vigor; 
laft  i*  nil;  right  is  nil; 
down  I*  nil; 
end; 
end; 


procedure  Encode  (its  >  entrytypa;  var  xsourca  t  char; 
var  xdifficulty  i  char; 

var  xconcratanass  i  char;  var  xamotion  ■  char; 
var  xvigor  i  char); 

begin 

if  (ita.sourca  ■  Latinata)  than  xsourca  iB  '1* 

alaa  xsourca  <  =  ' g'; 

casa  its . difficulty  of 

Postgrad  t  xdifficulty  t=  'p'; 

Grad  i  xdifficulty  t*  'g'; 

High_School  i  xdifficulty  t=  'h'; 

Elamantary  i  xdifficulty  >  =  ’a*; 
and; 

if  (its. concretanass  =  Tangible)  than  Xconcratanass  i*  '  t' 

also  Xconcratanass  t  =  * i • ; 

casa  its. emotion  of 

Sublima  t  xamotion  ia  ’s'; 

Pleasant  i  xamotion  is  'p'; 

Nuatral  i  xamotion  ia  ' n * ; 

Unpleasant  i  xamotion  »8  * u * ; 

Horrid  i  xamotion  i=  'h'; 

and; 

casa  its. vigor  of 

Violent  i  xvigor  i»  'v'; 

Energetic  i  xvigor  is  'a'; 

Calm  t  xvigor  i*  *c'; 

Inert  i  xvigor  t*  • i 1 ; 

and; 
and; 


procedure  Decode  (var  its  i  antrytype; 

xsourca  i  char;  xdifficulty  i  char; 
xconcratanass  i  char;  xamotion  >  char; 
xvigor  t  char); 

begin 

if  (Xsourca  a  *1')  than  its, source  Ist  Latinata 

else  its. source  i  =  Germanic; 

casa  Xdifficulty  of 

'p'  i  its. difficulty  < *  Postgrad; 

' g *  i  its . difficulty  i®  Grad; 

* h *  i  its , difficulty  t*  High_School ; 

'a*  i  its. difficulty  is  Elamantsry; 

and; 

if  (Xconcratanass111  't')  than  its . concretanass  *®  Tangible 

else  its. concreteness  i=  Intangible; 

case  Xamotion  of 

's'  i  its. amotion  i=  Sublima; 

'p'  i  its. amotion  ia  Pleasant; 

'n'  i  its. amotion  ia  Nuatral; 

•u'  i  its. amotion  ia  Unpleasant; 

'h'  i  its. amotion  i -  Horrid; 

and; 

case  Xvigor  of 

'v'  i  its. vigor  i»  Violent; 

'a*  i  its. vigor  **  Energetic; 

'c'  i  its. vigor  >=  Calm; 

1 i •  i  its.  vigor  i*  Inert; 

end; 


I 


procedure  AVL_Insart  (Entry 


entrytype;  var  o  i  Entrypointer; 
var  balanced  i  boolean); 


pi,  p2  i  entrypointer; 
begin 

if  (p  3  nil)  then  begin 
Attach  (Entry,  p); 

C-.  balance  «■  Zero; 
alanced  i*  true; 

end 

elsa  if  (Entry.word  3  p-,word)  then  Writeln  (•INSERT  COLLISION') 
else  if  (Entry.word  <  p-.word)  then  begin 
AVL  Insert  (Entry,  p-.left,  balanced); 

If  Balanced  than  (x  left  pointer  has  grown  higher  X) 
case  p-*. balance  of 

Plus  i  begin  p-. balance  t*  Zero;  balanced  i“  false;  end; 

Zero  i  p-. balance  i=  Minus; 

Minus  i  begin  (X  rebalance  X) 
pi  i»  p-.left; 

if  (pi-. balance  3  Minus)  then  bagin(X  single  LL  rotatx) 
P-.left  i»  pi-. right; 
pl-. right  «  *  p; 

P-. balance  i*  Zero; 

P  i3  pl; 
end  (X  if  X) 

else  begin  (X  double  LR  rotation  X) 
p2  i 3  pl-. right; 
pl-. right  t«  p2-.left; 
p2-,left  t»  pl; 
p-.left  <3  p2-. right; 
p2- , right  t  3  p; 

if  (p2-. balance  »  Minus)  than  p-. balance  i3  Plus 
else  p-. balance  i*  Zero; 

if  (p2-. balance  »  Plus)  then  pl-.balence  ia  Minus 
else  pl-. balance  »3  Zero; 

P  ' 3  p2; 
end;  (X  elsa  X) 

P-. balance  i3  Zero;  balanced  i3  false; 
and;  (X  case  of  Minus  X) 
end;  (X  of  oases  X) 
end  (x  if  Entry.word  <  p-.word  x) 


Mia ' 


•Is*  if  (  Entry. word  >  p-.word)  then  begin 

AVL  Insert  < Entry*  p-. right,  balancad); 
if  balanced  than  ( x  right  pointer  has  grown  higher  JO 
case  p-. balance  of 

Minus  i  begin  p-. balance  »■  Zero; 

balanced  t*  false;  end; 

Zero  (  p-. balance  i=  Plus; 

Plus  i  begin  (*  rebalance  X) 
dJ  ia  p-. right; 

if  <pl- . balance  s  Plus)  then  begin 
CX  single  RP  *) 

P-. right  >e  pi-. left; 
pi-. left  i=  p; 

P-, balance  >-  Zero; 
p  i=  pi; 
and  (X  if  X) 

else  begin  (X  double  RL  rotation  X) 
p2  Is  pi-. left; 
pi-. left  i«  p2-. right; 
p2-. right  i°  pi; 

P-. right  »«  p2-.left; 
p2-.left  i®  p; 

it  <p2-. balance  »  Plus)  then 
P-. balance  i*  Minus 
else  p-. balance  <  «  Zero; 
if  Cp2-. balance  *  Minus)  then 
pi-. balance  »«  Plus 
else  pl-.baience  »*  Zero; 

P  ia  p2; 

end;  (X  double  RL  rotation  X) 

P-. balance  t»  Zero;  balanced  i»  false; 
end;  (X  case  of  balance  ■  Plus  X) 
and;  (X  of  cases  X) 
end  (X  of  if  Entry. word  >  p-.word  X) 
else  balanced  i*  falsa; 
end;  (x  of  procedure  AVL_lnsert  X) 


procedure  Init_Rasearchar; 
var 
i, 

stapcount, 
thrashhold, 
dixlangth  <  intagar; 
a,  d,  o,  a,  v  i  char; 
entry  i  entrytype; 
balancad  >  boolean; 
basin 

Reset  (Dixonary,  'Dixonary  text  a'); 

New  (Root); 

Roadln  (Dixonary,  Root-. word,  a,  d,  c,  a,  v); 

Decods  (Root-,  s,  d,  c,  a,  v); 

Root-. left  i*  nil;  Root-. right  s a  nil; 

Root-. down  ia  nil; 

Root- . balance  t*  Zero; 

Root-. duration  i»  Permanent; 

Lastward  ts  nil; 

Stapcount  t *  0; 

while  not  EOF  (Dixonary)  do  begin 

Raadln  (Dixonary,  Entry. word,  a,  d,  e,  a,  v); 

Decode  (Entry,  a,  d,  c,  a,  v); 

Entry. balance  i s  Zero;  balanoed  i*  falsa; 

AVL_In*ert  (Entry,  Root,  balanced); 
stepcount  n  stapcount  +  1; 
if  (stapcount  mod  200  s  0)  tt  «n  begin 

Page;  Writeln;  Writeln;  Writeln;  Writeln; 

Writeln  ('Stylist  is  now  loading  its  dictionary. ' ) ; 

Writeln  ('Last  Word  loaded  was  ',  entry, wordilfi);  Writeln; 
Writeln  ('Stylist  has  loaded  1 , stepcount t 5, 1  entries.'); 
Writeln;  Writeln; 
and;  (x  if  *) 
and;  (K  while  not  EOF  X) 
end; 

procedure  Inorder  (var  Root  <  antrypointer; 

word  i  wordtype; 
var  P  t  antrypointer); 

begin 

if  (Root-. word  *  word)  then  p  <a  root 

else  if  (root-. word  >  word)  then  Inorder  (Root-. left,  word,  p) 
else  Inorder  (Root- . right,  word,  p); 
end; 


procedure  Store_BST  (p  i  antrypointer); 

var  s,  d,  c,  e,  v  i  char; 

bagin 

if  (p  <>  nil)  then  begin 

Encode  (p-,  s,  d,  c,  e,  v); 

Writeln  (Dixonary,  p-.word,  s,  d,  c,  e,  v); 
Store_BST  (p-.left); 

Store_BST  (p-. right); 
end; 
end; 

procedure  Store.. New_Dix; 
begin 

Rewrite  (Dixonery,  'Dixonary  text  a'); 
Store.BST  (Root); 
and; 


V  kVv'nWV/v  '.N '.\.NVVV-Wv\y\‘4V 


basin  (X  main  program  X) 

Init_Rasaarchar; 

Reset  (Latinat,  ' Latinat  taxt  a'); 

While  not  EOF  (Latinat)  do  begin 
Readln  (Latinat,  word,  c); 

if  (  c  <>  '1')  than  bagin  Inordtr  (root,  word,  p); 
p--. source  i =  germanic;  and; 
and; 

Raset  (Germane,  'Garmano  taxt  a'); 

While  not  EOF  (Germane)  do  bagin 
Raadln  (Garmanc,  word,  c); 

if  (  c  <>  'g')  than  basin  Inordar  (root,  word,  p); 
P-, source  i*  latinata;  and; 
and; 

Resat  (Tangbla,  'Tangbla  taxt  a'); 

Whila  not  EOF  (Tangbla)  do  begin 
Raadln  (Tangbla,  word,  c); 

if  (  c  <>  't')  than  bagin  Inordar  (root,  word,  p); 
p-1. concreteness  t  =  intangible;  and; 
and; 

Reset  (Intngble,  ’Intngbla  taxt  a1); 

Whila  not  EOF  (InTngble)  do  bagin 
Raadln  (Intngbla,  word,  c); 

if  (  c  <>  'i')  than  bagin  Inordar  (root,  word,  p); 

P- . concreteness  t-  tangible;  and; 
and; 

Rasat  (Postgrd,  'Postgrd  taxt  a')) 

Whila  not  EOF  (Postgrd)  do  bagin 
Readln  (Postgrd,  word,  c); 

if  (  c  <>  'p')  than  bagin  Inordar  (root,  word,  p); 
if  (c  »  'a')  than  p- . difficulty  i=  grad; 

if  (o  'h')  than  p- . difficulty  t=  high_school; 

if  (c  a  'a')  than  p->. difficulty  i*  alamantary;  and; 

and; 

Rasat  (Graduate,  'Graduate  taxt  a'); 

Whila  not  EOF  (Graduate)  do  bagin 
Raadln  (Graduate,  word,  c); 

if  (  c  <>  'g')  than  bagin  Inordar  (root,  word,  p); 
if  (c  *  'p')  than  p- . difficulty  t  =  postgrad; 

if  (c  =  'h')  than  p- . difficulty  «*  high_school; 

if  (c  -  'a')  than  p- . difficulty  i=  alamantary;  and; 

and; 

Rasat  (HighSchl,  'HighSchl  taxt  a'); 

Whila  not  EOF  (HighSchl)  do  bagin 
Raadln  (HighSchl,  word,  c); 

if  (  c  <>  'h')  than  bagin  Inordar  (root,  word,  p); 
if  (c  a  'p')  than  p^ . difficulty  ia  postgrad; 

if  (e  =  'g')  than  P-. difficulty  i  =  grad; 

if  (o  *  'a')  than  p-. di fficulty  >a  alamantary;  and; 

and; 

Rasat  (Elements,  ' Elamanta  taxt  a'); 

Whila  not  EOF  (Elamanta)  do  bogin 
Raadln  (Elamanta,  word,  c); 

if  (  c  <>  'a')  than  bagin  Inordar  (root,  word,  p); 
if  (c  -  'p')  than  p- , difficulty  t=  postgrad; 

if  (c  =  'g')  than  p- . difficulty  grad; 

if  (c  =  'h')  than  p->.  difficulty  i*  high_school;  arid; 

and; 
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Reset  (Sublme,  * Sublme  text  s'); 

While  not  EOF  (Sublme)  do  begin 
Readln  (Sublme/  word/  c); 

if  (  c  <>  ’s')  then  begirt  Inorder  (root/  word/  p); 
if  (e  »  'p')  then  p-. emotion  !=  pleasant; 
if  (c  a  »n')  then  p-<.  emotion  t=  nuetral; 


-  nuetral; 


A  I  V  u  •  n  *  Ul«n  H  *  »»I»V  W*WII  I-  Iiwevi  m  a  f 

if  (c  =  ’u')  then  p-. emotion  t3  unpleasant; 

•  ai  »  _  IL  I  \  J.L  _  _  U _ _  —  1 _ J  J 


if  Cc  3  'h* ) 
end; 


.emotion  tB  horrid; 


Reset  (Pleasnt/  'Pleasnt  text  a1); 

While  not  EOF  (Pleasnt)  do  begin 
Readln  (Pleasnt/  word/  c); 

if  (  c  <>  'p')  then  begin  Inorder  (root/  word/  p); 
if  (c  =  ’s')  then  P-, emotion  t  =  sublime; 

if  (c  *  'n')  then  p- . emotion  i=  nuetral; 

if  (c  *  * u ' )  then  p-. emotion  i=  unpleasant; 

if  (c  =  'h')  then  P">. emotion  t=  horrid;  end; 

end; 


Reset  (Nuetrl/  'Nuetrl  text  a'); 

While  not  EOF  (Nuetrl)  do  begin 
Readln  (Nuetrl/  word/  c); 

if  (  c  <>  *n')  then  begin  Inorder  (root,  word,  p); 
if  (c  =  '*»)  then  p->. emotion  i=  sublime; 

if  (c  »  'p')  then  p->. emotion  t°  pleasant; 

if  (c  =  ’u')  then  p". emotion  i*  unpleasant; 

if  (c  a  'h')  then  p-. emotion  13  horrid;  end; 

end; 

Reset  (Unpleesn,  'Unpleasn  text  s'); 

While  not  EOF  (Unpleasn)  do  begin 
Readln  (Unpleasn,  word,  c); 

if  (  c  <>  'u')  then  begin  Inorder  (root,  word,  p); 
if  (c  *  's')  then  p-. amotion  i®  sublime; 

if  (o  3  *p')  then  P-. emotion  s  =  pleasant; 

if  (c  3  'n')  then  p-*. emotion  <a  nuetral; 

if  (c  8  'h')  than  p-1. emotion  >s  horrid;  end; 

end; 


Reset  (Horrd,  'Horrd  text  a'); 
While  not  EOF  (Horrd)  do  begin 
Raadln  (Horrd,  word,  c); 

■  /  _  j  v  I  I.  I  S  1L _ I.  .  _  T 


if  (c 
if  (c 
if  ( c 
if  (c 


vnorra*  wore;  cv* 

<>  * H * )  then  begin  Inorder  (root,  word,  p); 

I  _  I  \  .LI —  _  J.  J -  _ . .  L  I  J . 


•then  p-*.  emotion 
then  p-. emotion 
then  p-. emotion 
then  p^. emotion 


sublime; 

pleasant; 

nuatral; 

unpleasant;  end; 


Reset  (Violnt,  'Violnt  text  a'); 

While  not  EOF  (Violnt)  do  begin 
Readln  (Violnt,  word,  c); 

if  (  c  <>  'v')  then  begin  Inorder  (root,  word,  p); 
if  (c  8  'e')  than  p-. vigor  i=  energetic; 
if  (c  3  'o')  than  p-*. vigor  »3  calm; 

if  (c  3  * i * )  than  p-. vigor  i3  inert;  and; 

end; 

Reset  (Energeti,  'Energeti  text  a'); 

While  not  EOF  (Energeti)  do  begin 
Readln  (Energeti,  word,  c); 

if  (  c  <>  'a')  then  begin  Inorder  (root,  word,  p); 
if  (c  3  'v')  then  p-. vigor  i*  violent; 

if  (c  3  *  c ' )  then  p-. vigor  i"  calm; 

if  (c  =  * i ‘ )  then  p-. vigor  i=  inert;  end; 

end; 


msssm 


msssmm 


Reset  (Clm/  'Clm  text  a'); 

While  not  EOF  (Clm)  do  begin 
Reedln  (Clm/  word/  c) ; 
if  (  c  <>  »c»)  then  begin  Inorder  (root/ 
if  (o  *  'v')  then  p-. vigor  i=  violent; 

if  (c  “  's’)  then  p-. vigor  t=  eneraetio; 

if  (c  *  'i')  then  p-. visor  ts  inert;  end 

end; 

Reset  (Inrt/  •  Inrt  text  e'); 

While  not  EOF  (Inrt)  do  begin 
Racidln  (Inrt/  word/  o); 
if  (  c  <>  * i * )  then  begin  Inorder  (root/ 
if  (c  =  * v ' )  then  p-. vigor  i=  violent; 

if  (c  =  *e')  then  p-. vigor  i=  energetic; 

if  (c  =  'c')  than  p- . vigor  ia  calm;  end; 

end; 

Store„New_Dix; 


end. 


word/  p) 


word/  p) 


107 


APPENDIX  E 
REPRESENTATIVE  RUNS 


PROFILE 


Plato  a  Phaadrns 


Total  of  santancaa  i  17 
Total  of  words  i  560 
Total  of  lattars  >  1595 


360  words  ara  too  faw  for  valid  atatiatical  analysis 


Of  tha  360  words  in  Plato's  Phaadrus 

312  wara  natchad  to  words  in  tha  Stylist  dictionary. 

86  '/.  wara  matchad. 

This  is  enough  for  statistical  analysis. 


BREAKDOWN  OF  PERCENT  OF  LETTERS  PER  WORD 


30 

28 

26 

24 

22 


20 

18 

16 

14 

12 

10 

8 

6 

4 

2 


2 


X 

X 

X 

X 

X 

X 

X  X 

X  X 

X  X 

3  4 
Number 


X  X 

X  X 

X  X  X  X 

X  X  X  X 

X  X  X  X 

X  XXX 

X  X  X  X 

X  X  X  X 

5  6  7  8 

of  Letters  in  a  Word 


X 

X 

X 
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X 

X 

10  11 


The  average  numbar  of  letters  pap  word  i  6 . 131428571E+00 


12 


13  14 


15 


A  typical  nonfiction  texts  distribution 

rasamblas  a  low  bell-shaped  curve  centered  around  six  latters/word. 
The  length  of  the  words  is  medium 


BREAKDOWN  OF  NUMBER  OF  WORDS  PER  SENTENCE 


1 

Number  of  word* 
100 
92 
96 
94 
92 
90 
28 
26 
84 
82 
80 
78 
76 
74 
72 
70 
68 
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64 
62 
60 


54 

52 

50 

48 

46 

44 

42 

40  X 

38  X 

36  X 

34  X 

32  X  XX 

30  X  XX 

28  X  XX  X 

26  X  XX  X 

24  XXX  XX  X 

22  X  XXXX  XX  X 

20  X  XXXX  XX  X 

13  X  XXXXX  XXX  X 

16  XXXXXXXX  XXX  X 

14  XXXXXXXX  XXX  X 

12  XXXXXXXXXXXXXXX  X 
10  XXXXXXXXXXXXXXXXX 
8  XXXXXXXXXXXXXXXXX 
6  XXXXXXXXXXXXXXXXX 
4  XXXXXXXXXXXXXXXXX 
2  XXXXXXXXXXXXXXXXX 

10  20  30  40  50  60  70 

Santanca  Numbar 

The  avaraga  numbar  of  words  par  santanca  t  2. 117647059E+01 

A  typical  madam  taxis  santancas  avaraga  batwaan  fiftaan  and  twanty  words 
Santancas  ara  long. 


Numbar  of  run  ons  0 
Parcant  of  run  ons  0 
Run  ons  ara  nonaxistsnt, 


Number  of  medium  length  sentences  9 
Percent  of  medium  length  sentences  52 

Modulation  is  good. 

ETYMOLOGY  OF  WORDS 

Number  of  Letinete  words  i  51 

Number  of  Germanio  words  i  95 

Etymology  is  very  native. 


DIFFICULTY  OF  VOCABULARY 


PostGraduate  difficulty  i  0 
Graduate  difficulty  i  1 
High  School  difficulty  i  25 
Elementary  difficulty  <  120 


Percent  of  Postgraduate  difficulty  0 

Percent  of  Graduate  difficulty  0 

Percent  of  High  School  difficulty  8 


Difficulty  is  easy. 


TANGIBILITY 


Number  of  Tangible  words  i  35 

Number  of  Intangible  words  i  111 

Tangibility  is  tangibible. 


EMOTIONAL  CONNOTATIONS 


Sublime  connotations  ■  7 
Pleasant  connotations  i  44 
Neutral  connotations  i  87 
Unpleasant  connotations  ■  8 
Horrid  connotations  i  0 


Percent  of  sublime  connotations  2 
Peroant  of  pleasant  connotations  14 
Percent  of  unpleasant  connotations  2 
Percent  of  horrid  connotations  0 


Index  of  Emotionality 
Emotionality  is  Rich 
Tone,  is  Positive 
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VIGOR  OF  WORDS 


Words  of  Extrams  Visor  t  0 
Words  of  Much  Visor  t  17 
Words  of  Soms  Visor  i  75 
Words  of  Littla  Visor  t  54 


Parcont  of  words  of  axtrama  vigor  i  0 

Parcant  of  words  of  much  visor  i  5 

Parcant  of  words  of  soma  vigor  i  24 

Indax  of  Visor  49 

Visor  is  livaly 

Congratulations!  You  navar  saam  to  writs  run  on  santanoss. 

This  is  a  solid  piaoa  of  writing  wall  within  ths  traditions  of  its  ganra. 


Phaedrus 


CONCORDANCE 


t 

I* 


them  and  thamus  enquired  ABOUT  thain  savaral  usas  and 

discovarad  is  not  an  AID  to  mamory  but  to 

othor  agyptians  might  ba  ALLOWED  to  have  tha  banafit 

an  art  is  not  ALWAYS  tha  bast  judga  of 

is  cailad  by  tham  AMMON  to  him  cams  thauth 

learnad  nothing  thay  will  APPEAR  to  ba  omnisciant  and 

cansurad  othars  as  ha  APPROVED  or  disapprovad  of  tham 

many  arts  such  as  ARITHMETIC  and  calculation  and  geomatry 

or  invantor  of  an  ART  is  not  always  tha 

tha  invantor  of  many  ARTS  such  as  arithmatic  and 

blame  of  tha  various  ARTS  but  whan  thay  came 

calculation  and  gaomatry  and  ASTRONOMY  and  draughts  and  dice 

have  boon  lad  to  ATTRIBUTE  to  tham  a  quality 

in  tha  laarnars'  souls  BECAUSE  thay  will  not  use 

your  own  children  have  BEEN  lad  to  attribute  to 

allowed  to  have  tha  BENEFIT  of  tham  ha  anumaratad 

is  not  always  ths  BEST  judge  of  the  utility 

wiser  and  give  tham  BETTER  memories  it  is  a 

name  was  theuth  tha  BIRD  which  is  called  the 

theuth  in  praise  or  BLAME  of  the  various  arts 

it  is  a  specific  BOTH  for  the  memory  and 

such  as  arithmatic  and  CALCULATION  and  gaomatry  and  astronomy 

agypt  which  tha  hellenos  CALL  agyptian  thabos  and  tha 

the  bird  which  is  CALLED  the  ibis  is  sacred 

the  god  himself  is  CALLED  by  them  ammon  to 

them  ammon  to  him  CAME  thauth  and  showed  his 

arts  but  whan  thay  CAME  to  letters  this  said 

a  quality  which  thay  CANNOT  have  for  this  discovery 

soma  of  tham  and  CENSURED  othars  as  ha  approved 

to  tha  external  written  CHARACTERS  and  not  remember  of 

leva  of  your  own  CHILDREN  have  baan  lad  to 

socrates  at  tha  agyptian  CITY  of  naucratis  thara  was 

dwelt  in  that  great  CITY  of  upper  egypt  which 

thay  will  ba  tiresome  COMPANY  having  the  show  of 

king  of  the  whole  COUNTRY  of  egypt  and  he 

discovery  of  yours  will  CREATE  forgetfulness  in  the  learners' 

letters  now  is  those  DAYS  the  god  thamus  was 

and  showed  his  inventions  DESIRING  that  tha  other  agyptians 

astronomy  and  draughts  and  DICE  but  his  great  discovery 

as  ha  approved  or  DISAPPROVED  of  tham  it  would 

and  you  give  your  DISCIPLES  not  truth  but  only 

specific  which  you  have  DISCOVERED  is  not  an  aid 

dice  but  his  great  DISCOVERY  was  tha  use  of 

cannot  have  for  this  DISCOVERY  of  yours  will  craata 

gaomatry  and  astronomy  and  DRAUGHTS  and  dice  but  his 

of  agypt  and  ha  DWELT  in  that  great  city 

tha  whole  country  of  EGYPT  and  ha  dwelt  in 

great  city  of  upper  EGYPT  which  the  hellenes  cell 

socrates  at  tha  EGYPTIAN  city  of  naucratis  thara 

which  tha  hellanea  call  EGYPTIAN  thabes  and  tha  god 

desiring  that  tha  other  EGYPTIANS  might  ba  allowed  to 

thauth  will  make  tha  EGYPTIANS  wiser  and  give  tham 

enumerated  tham  and  thamus  ENQUIRED  about  thair  savaral  usas 

banafit  of  tham  he  ENUMERATED  them  and  thamus  enquired 

will  trust  to  the  EXTERNAL  written  characters  and  not 

naucratis  there  was  a  FAMOUS  old  god  whose  name 

you  who  are  the  FATHER  of  letters  from  tha 

of  yours  will  create  FORGETFULNESS  in  tha  learners'  souls 

ba  omnisciant  and  will  GENERALLY  know  nothing  they  will 

arithmetic  and  calculation  and  GEOMETRY  and  astronomy  and  draughts 

the  agyptians  wiser  end  GIVE  them  batter  memories  it 

to  reminiscence  and  you  GIVE  your  diuciples  not  truth 
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waa  a  famous  old  GOD  whosa  nama  was  thauth 

is  those  days  the  GOO  thamus  was  the  kino 

agyptian  thebea  and  tha  GOD  himsalf  is  cal lad  by 

and  dice  but  his  GREAT  discovary  was  tha  usa 

ha  dwalt  in  that*  GREAT  city  of  uppar  egypt 

will  ba  tirasoma  company  HAVING  tha  show  of  wisdom 

truth  thay  will  ba  HEARERS  of  many  things  and 

uppar  agypt  which  tha  HELLENES  call  agyptian  thabas  and 

thabas  and  tha  god  HIMSELF  is  callad  by  than 

which  is  callad  tha  IBIS  is  sacrad  to  him 

thamus  rapliad  o  most  INGENIOUS  thauth  tha  parant  or 

thsm  and  in  this  INSTANCE  you  who  ara  tha 

of  tha  utility  or  INUTILITY  of  his  own  invantions 

thauth  and  showed  his  INVENTIONS  desiring  that  tha  other 

inutility  of  his  own  INVENTIONS  to  tha  usars  of 

and  h*  was  tha  INVENTOR  of  many  arts  such 

thauth  tha  parant  or  INVENTOR  of  an  art  is 

not  always  tha  bast  JUDGE  of  tha  utility  or 

god  thamus  was  tha  KING  of  tha  whole  country 

omniscient  and  will  ganarally  KNOW  nothing  thay  will  ba 

things  and  will  have  LEARNED  nothing  they  will  appear 

create  forgetfulness  in  tha  LEARNERS'  souls  because  thay  will 

own  children  have  bean  LED  to  attributa  to  them 

was  tha  usa  of  LETTERS  now  is  those  days 

whan  thay  came  to  LETTERS  this  said  thauth  will 

are  tha  father  of  LETTERS  from  tha  paternal  love 

it  would  taka  a  LONG  time  to  repast  all 

lsttars  from  tha  paternal  LOVE  of  your  own  children 

this  said  thauth  will  MAKE  tha  egyptians  wiser  and 

was  tha  inventor  of  MANY  arts  such  as  arithmetic 

will  ba  hearers  of  MANY  things  and  will  have 

and  give  them  batter  MEMORIES  it  is  a  specific 

will  not  usa  thair  MEMORIES  thay  will  trust  to 

specific  both  for  tha  MEMORY  and  for  tha  wit 

not  an  aid  to  MEMORY  but  to  raminisoanca  and 

that  tha  other  egyptians  MIGHT  bs  allowed  to  have 

wit  thamus  rapliad  o  MOST  ingenious  thauth  tha  parant 

famgus  old  god  whosa  NAME  was  thauth  tha  bird 

tha  agyptian  city  of  NAUCRATIS  there  was  a  famous 

and  will  have  learned  NOTHING  thay  will  appear  to 

and  will  generally  know  NOTHING  thay  will  ba  tirasoma 

tha  wit  thamus  replied  0  most  ingenious  thauth  the 

thara  was  a  famous  OLD  god  whosa  name  Was 

will  appear  to  ba  OMNISCIENT  and  will  ganarally  know 

invantions  dasiring  that  tha  OTHER  egyptians  might  bs  allowed 

of  them  and  censured  OTHERS  as  ha  approved  or 

or  inutility  of  his  OWN  invantions  to  tha  usars 

paternal  love  of  your  OWN  children  have  been  lad 

most  ingenious  thauth  tha  PARENT  or  inventor  of  an 

of  letters  from  tha  PATERNAL  love  of  your  own 

said  to  theuth  in  PRAISE  or  blame  of  the 

thair  several  uses  and  PRAISED  soma  of  them  and 

attributa  to  tham  a  QUALITY  which  thay  cannot  have 

of  wisdom  without  tha  REALITY 

written  characters  and  not  REMEMBER  of  themselves  tha  specific 
to  memory  but  to  REMINISCENCE  and  you  give  your 
a  long  time  to  REPEAT  all  that  thamus  said 
for  the  wit  thamus  REPLIED  o  most  ingenious  theuth 

callad  the  ibis  is  SACRED  to  him  and  he 

repeat  all  that  thamus  SAID  to  theuth  in  praise 
came  to  letters  this  SAID  theuth  will  make  the 
truth  but  only  tha  SEMBLANCE  of  truth  thay  will 
thamus  enquired  about  thair  SEVERAL  uses  and  praised  soma 
tiresome  company  having  the  SHOW  of  wisdom  without  the 
him  came  theuth  and  SHOWED  his  invantions  desiring  that 
SOCRATES  at  the  agyptian  city 


several  usas  and  praised  SOME  of  tham  and  oansurad 

forgetfulness  in  the  learners'  SOULS  baoauaa  thay  will  not 

memories  it  is  a  SPECIFIC  both  for  tha  memory 

remember  of  thamaalvaa  tha  SPECIFIC  which  you  hava  diaoovarad 

of  tham  it  would  TAKE  a  long  tima  to 

thosa  days  tha  god  THAMUS  was  tha  king  of 

ha  enumerated  tham  and  THAMUS  enquired  about  thair  savaral 

to  rapaat  all  that  THAMUS  said  to  thauth  in 

and  for  tha  wit  THAMUS  rapliad  o  most  ingenious 

tha  hallanas  call  agyptian  THEBES  and  tha  god  himself 

and  thamus  anquirad  about  THEIR  savaral  uses  and  praisad 

thay  will  not  usa  THEIR  mamoriaa  thay  will  truat 

and  not  ramembar  of  THEMSELVES  tha  spaoifio  which  you 

agyotian  city  of  naucratis  THERE  was  a  famous  old 

god  whosa  nama  was  THEUTH  tha  bird  which  is 

ammon  to  him  cama  THEUTH  and  showad  his  invantions 

that  thamus  said  to  THEUTH  in  praisa  or  blama 

to  letters  this  said  THEUTH  will  make  tha  egyptiana 

rapliad  o  most  ingenious  THEUTH  tha  parent  or  inventor 

be  haarars  of  many  THINGS  and  will  hava  learned 

of  letters  now  is  THOSE  dsys  tha  god  thamus 

would  taka  a  long  TIME  to  rapaat  all  that 

nothing  thay  will  ba  TIRESOME  company  having  tha  ahow 

thair  mamoriaa  thay  will  TRUST  to  tha  external  written 

give  your  diaciples  not  TRUTH  but  only  tha  semblance 

only  tha  semblance  of  TRUTH  thay  will  ba  haarars 

that  great  city  of  UPPER  egypt  which  tha  hallanas 

great  discovery  was  tha  USE  of  letters  now  is 

bacausa  thay  will  not  USE  thair  mamoriaa  thay  will 

own  invantions  to  tha  USERS  of  tham  and  in 

anquirad  about  thair  several  USES  and  praiaad  soma  of 

bast  judge  of  tha  UTILITY  or  inutility  of  his 

or  blame  of  tha  VARIOUS  arts  but  whan  they 

tha  various  arts  but  WHEN  thsy  cama  to  letters 

was  thauth  ths  bird  WHICH  is  called  tha  ibis 

city  of  upper  egypt  WHICH  tha  hallanas  osll  egyptisn 

to  tham  a  quality  WHICH  thay  cannot  have  for 

of  themselves  tha  specific  WHICH  you  hava  discovered  is 

tha  king  of  tha  WHOLE  country  of  egypt  and 

a  famous  old  god  WHOSE  nams  was  thauth  tha 


having  tha  show  of  WISDOM  without  ths  reality 

will  make  tha  agyptians  WISER  and  give  tham  batter 

memory  and  for  tha  WIT  thamus  replied  o  most 

tha  show  of  wisdom  WITHOUT  tha  reality 

disapproved  of  tham  it  WOULD  taka  a  long  tima 

trust  to  tha  external  WRITTEN  oharaotara  and  not  remember 

for  this  discovery  of  YOURS  will  create  forgetfulness  in 


Student  Paper.  Example  of  Poor  Writing 

PROFILE 


Total  of  sentences  i  62 
Total  of  words  i  2026 
Total  of  letters  t  10561 


2026  words  are  plenty  for  valid  statistical  analysis. 


Of  the  2026  words  in  Student  Example  of  Poor  Writing  > 

1639  were  matched  to  words  in  if-,*  Ttyliat  dictionary. 

90  %  wars  matched. 

This  is  enough  for  statistical  analysis. 
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BREAKDOWN  OP  PERCENT  OF  LETTERS  PER  WORD 
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Lattars  in  a  Word 
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Tha  avaraga  numbar  of  lattara  par  word  i  6 .921887713E+0Q 
A  typical  nonfiction  taxts  distribution 

raaemblas  a  low  ball-shapad  ourva  cantarad  around  six  lattara/word. 
Tha  langth  of  tha  words  is  long. 
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10  20  30  40  50  60  70 

Santanco  Numbar 

Tho  avaraga  numbar  of  worda  par  aantanca  i  3 , 267741935E+Q1 


A  typical  modarn  tsxts  aantanca*  avaraga  batwaan  fiftaan  and  twsnty  word*. 

Santanca*  ara  too  long. 

Numbar  of  run  on*  13 
Parcant  of  run  ona  20 

Run  on*  ara  unaecaptabla 
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Number  of  medium  length  iml.ncu  14 
Percent  of  medium  length  sentences  22 

Modulation  is  average 


ETYMOLOGY  OF  WORDS 


Number  of  Latinata  word* 
Numbar  of  Germanic  word* 

Etymology  is  mixed 


DIFFICULTY  OF  VOCABULARY 


Postgraduate  difficulty  . 

Graduate  difficulty  , 

High  School  difficulty  , 

Elementary  difficulty  i 

Percent  of  Postgraduate  difficulty 
Percent  of  Graduate  difficulty 
Percent  of  High  School  difficulty 


Difficulty  ia  very  hard 


Number  of  Tangible  words  i 
Number  of  Intangible  words  ■ 

Tangibility  is  tangibible. 


TANGIBILITY 


EMOTIONAL  CONNOTATIONS 


Sublime  connotations 
Pleasant  connotations 
Neutral  connotations 
Unpleasant  connotations 
Horrid  connotations 


Percent  of  sublime  connotations 
Percent  of  pleasant  connotations 
Percent  of  unpleasant  connotations 
Percent  of  horrid  connotations 

Index  of  Emotionality  i 

Emotionality  is  Rich 

Tone  is  Positive 


WlVL'V' 


V 

s. 

*> 

A  X. 


VIGOR  OF  WORDS 


Words  of  Extreme  Vigor  i  1 
Words  of  Much  Vigor  ■  94 
Words  of  Some  Vigor  i  464 
Words  of  Little  Vigor  i  551 


Percent  of  words  of  extreme  vigor  t  0 

Percent  of  words  of  much  vigor  i  5 

Percent  of  words  of  soma  vigor  i  25 

Index  of  Vigor  50 

Vigor  is  lively 


RECOMMENDATION  NUMBER  1 


Yog  tend  to  write  run-on  santancas. 

Chaok  your  longest  santancas  for  run  ons. 
Break  them  up  into  units  of  single  ideas. 


RECOMMENDATION  NUMBER  2 


Your  average  santancas  are  too  long  for  the  difficulty  of  your  vocabulary. 
Use  simpler  words  or  shorter  santancas. 


Excerpt  from  A  Farewell  to  Arms 


Total  of  santanoaa  i 
Total  of  word*  i 
Total  of  letters  t 


1349  words  iri 


PROFILE 


65 

1349 

5431 


planty  for  valid  statistical  analysis. 


Of  tha  1349  words  in  Excarpt  from  A  Farawall  to  Arms 
1197  wars  matched  to  words  in  the  Stylist  dictionary. 


88  V,  war#  matched. 

This  is  enough  for  statistical  analysis. 
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BREAKDOWN  OF  PERCENT  OF  LETTERS  PER  WORD 
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22 

20 

18 

16 

14 

12 

10 

8 

6 

4 

2 


X 

2 


X 

X 

X 

X 

X 


X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 


X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 


X 

X 

X 

X 

X 

X 

X 


X 

X 

X 

X 


X 

X 

X 


3  4  5  6  7  8 

Number  of  Lattars  in  a  Word 


X 

9 


10 


11  12  13  14  15 


Tha  average  number  of  letters  par  word 


5.100817439E+00 


A  typical  fiction  texts  distribution 
raaamblas  a  tall  ball-shaped  curvo  cantarad  around  fiva  1 attars/ word. 


Tha  langth  of  the  words  is  medium 
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1 

Number 

100 

98 

96 

94 

92 

90 

88 

86 

84 

82 

80 

78 

76 

74 

72 

70 

68 

66 

64 

62 

60 

58 

56 

54 

52 

50 

48 

46 

44 

42 

40 

38 

36 

34 


of  words 


BREAKDOWN  OF  NUMBER  OF  WORDS  PER  SENTENCE 


X 

X 

X 

X 

X 

X 

X 

X 

X 


X 

X 

X 

X 

X 

X 


X 

X 

X 


X 

X 

X 

X 

X 

X 

X 

X 


X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 


X 

X 


32 

X 

XX  X  X 

X 

X 

30 

X 

XX  X  X 

X 

XX  X 

28 

X  X 

XX  X  XX 

X 

X 

X 

X 

XX  X  X 

26 

XX  X 

XXXX  XX 

X 

X 

X 

X 

X 

XX  X  X 

24 

XX  X 

XXXX  XX 

XX 

X 

X 

X 

X 

XX 

XX  XX  X 

22 

XX  X 

X  XXXX  XX 

XX 

X 

X 

X 

X 

X 

XX 

X 

XX  XX  XXX 

20 

XXXX 

XXXXXX  XX 

XX 

X 

X 

X 

X 

X 

X 

XX 

X 

X 

XX  XX  XXX 

18 

XXXX 

xxxxxxxxx 

XXX 

X 

X 

X 

X 

X 

X 

X 

XX 

X 

X 

XX  XXXXXX 

X 

16 

XXXX 

xxxxxxxxxxx 

XXXXXX 

X 

X 

X 

X 

X 

X 

X 

XX 

X 

XXXXXXXXXXXX 

X 

14 

XXXX 

xxxxxxxxxxx 

XXXXXX 

X 

X 

XX 

X 

X 

X 

X 

XXX 

X 

XXXXXXXXXXXX 

X 

12 

XXXX 

xxxxxxxxxxx 

XXXXXX 

XXXX 

XX 

X 

X 

X 

XXX 

XXX 

X 

XXXXXXXXXXXX 

X 

10 

XXXX 

XXXXXXXXXXXXXXXXXX 

XXXX 

XX 

XXX 

X 

XXXXX 

XXX 

xxxxxxxxxxxxxx 

X 

8  XXXX  XXXXXXXXXXXXXXXXXX  XXXXXXX  XXXXXXXXXXXX  XXX  XXXXXXXXXXXXXXXX 
6  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX  xxxxxxxxxxxxxxxxxxxx 
4  xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 
2  xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 

10  20  30  40  50  60  70 

Sentence  Number 


The  average  number  of  words  per  sentence 


2.075384615E+01 


A  typical  modern  texts  sentences  average  between  fifteen  and  twenty  words. 

Sentences  are  long. 

Number  of  run  ons  3 

Percent  of  run  ons  4 


Run  ons  are  acceptable. 


Number  of  medium  length  sentences  26 
Percent  of  medium  length  santancas  40 

Modulation  is  ayaraga 


ETYMOLOGY  OF  WORDS 

Number  of  Latinata  word*  i  1S7 

Number  of  Germanic  words  i  484 

Etymology  is  vary  native. 


DIFFICULTY  OF  VOCABULARY 


PoatGraduata  difficulty 
Graduate  difficulty 
High  School  difficulty 
Eiamantary  difficulty 


0 

0 

25 

596 


Parcant  of  Postgraduate  difficulty 
Parcant  of  Graduate  difficulty 
Percent  of  High  School  difficulty 


0 

0 

2 


Difficulty  is  easy, 


TANGIBILITY 

Number  of  Tangible  worda  i  179 

Numbar  of  Intangible  words  i  442 

Tangibility  is  tangibible. 


EMOTIONAL  CONNOTATIONS 


Sublime  connotations  <  1 
Pleasant  connotations  t  135 
Neutral  connotations  <  450 
unpleasant  oonnotations  1  34 
Horrid  connotations  1  1 


Percent  of  sublime  connotations 
Percent  of  pleasant  connotations 
Percent  of  unpleasant  connotations 
Parcant  of  horrid  connotations 

Index  of  Emotionality  26 

Emotionality  is  Rich 

Tone  is  Positive 


0 

11 

2 

0 


VIGOR  OF  WORDS 


Words  of  Extreme  Vigor  *  4 
Words  of  Much  Vigor  t  73 
Words  of  Soma  Vigor  i  324 
Words  of  Little  Vigor  i  220 


Percent  of  words  of  extrema  vigor  i  0 

Parcaivt  of  words  of  much  vigor  t  6 

Parcant  of  words  of  soma  vigor  i  27 

Index  of  Vigor  57 

Vigor  is  strong 


This  is  a  solid  piece  of  writing  wall  within  tha  traditions  of  its  ganra. 

You  are  as  abla  to  undarstand  tha  maaning  of  tha  abova  characteristics  as  Tha  St 
ylist. 

Stand  tha  couraa! 
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Computer  Scienc*  Text  1 


PROFILE 


Total  of  sentences  <  56 
Total  of  word*  i  1176 
Total  of  lattara  t  5668 


1178  words  ara  planty  for  valid  statistical  analysis. 


Of  tha  1178  words  in  Computer  Science  Text  1 

1026  ware  matched  to  words  in  tha  Stylist  dictionary. 

87  ware  matched, 

This  is  enough  for  statistical  analysis. 


BREAKDOWN  OF  PERCENT  OF  LETTERS  PER  WORD 

JO 

28 

26 

24 

22 

20 


IS 

t 

16 

. 

X 

14 

X 

X 

X 

12 

X 

X 

X 

4 

X 

10 

X 

X 

X 

X 

X 

X 

8 

X 

X 

X 

X 

X 

X 

6 

1 

X 

X 

X 

X 

X 

X 

4 

X 

X 

X 

X 

X 

X 

X 

. 

2 

.  X 

X 

X 

X 

X 

X 

X 

X 

1 

2  3 

4 

5 

6 

7 

8 

9 

10 

Number 

of  Letters 

in  a 

Word 

The  average  number  of  letters  per  word  t  6 . 500000000E+00 


A  typical  nonfiction  texts  distribution 

resembles  a  low  bell-shaped  curve  centered  around  six  letters/word. 
The  length  of  the  words  is  medium 


15 
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1  BREAKDOWN  OF  NUMBER  OF  WORDS  PER  SENTENCE 

Number  of  words 
100 
90 
96 
94 
92 
90 
88 
86 
84 
82 
80 
78 
76 
74 
72 
70 
68 
66 
64 
62 
60 
58 
56 
54 
52 
50 
48 


44 

X 

X 

42 

X 

X 

X 

40 

X 

X 

X 

38 

X 

X 

X 

X 

36 

X 

X 

X 

X 

X 

34 

X 

X 

XX 

X 

X 

32 

X 

x 

X 

XX 

XX 

X 

X 

30 

X 

X 

X 

XX 

XX 

X 

X 

X 

28 

X 

X 

X 

X 

XX 

XX 

X 

X 

X 

XXX 

26 

X 

X 

X 

X 

X 

XX 

XXX 

X 

X 

X 

X 

X 

XXX 

24 

X 

X 

X 

X 

X 

X 

XX 

XXX 

X 

XX 

X 

X 

X 

XXX 

22 

X 

X 

X 

X 

X 

X 

XX 

XXX 

X 

XX 

X 

X 

X 

XXX 

20 

XX 

X 

X 

XXX 

X 

XX 

XX 

xxxx 

X 

XX 

X 

X 

X 

XXX 

18 

X  XX 

X 

X 

XX 

XXXXX 

XX 

XX 

xxxx 

X 

XX 

X 

X 

X 

XXX 

16 

xxxxxxx 

X 

X 

XXX 

xxxxx 

XXXX 

XXX 

xxxxxxxxxxx 

X 

X 

X 

XXX 

14 

xxxxxxx 

X 

X 

xxxxxxxxx 

xxxx 

XXX 

xxxxxxxxxxx  X 

X 

XXX 

XXX 

12 

xxxxxxx 

XXX 

xxxxxxxxx 

xxxxxxxxxxxxxxxxxxxxxx 

X 

XXX 

XXX 

10 

xxxxxxx 

XXX 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 

XXXXXXX 

8  XXXXXXX  XXX  xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx  xxxxxxx 
6  XXXXXXX  XXX  xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 
4  xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 
2  xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 

10  20  30  40  50  60  70 

Santanca  Numbar 

Tha  avaraya  numbar  of  words  par  santanca  i  2 . 103571428E+01 

A  typical  modarn  taxts  santanc as  avaraya  batwaan  fiftaan  and  twanty  words. 

Santancas  ara  long. 

Numbar  of  run  ons  1 

Parcant  of  run  ons  1 


Run  ons  ara  accaptabla. 


Number  of  medium  length  sentences  28 
Percent  of  medium  length  sentences  SO 

Modulation  is  average 


ETYMOLOGY  OF  WORDS 


Number  of  Latinate  words  • 

422 

Number  of  Germanic  words  > 

227 

Etymology  is  mixed 

DIFFICULTY 

OF  VOCABULARY 

PostGraduate  difficulty  ■ 

1 

Graduate  difficulty  t 

55 

High  School  difficulty  i 

275 

Elementary  difficulty  > 

320 

Percent  of  Postgraduate  difficulty  0 
Percent  of  Graduate  difficulty  5 
Percent  of  High  School  difficulty  26 


Difficulty  is  challenging 


TANGIBILITY 


Number  of  Tangible  words  t  106 

Number  of  Intangible  words  i  543 

Tangibility  is  vary  intangibibla. 


EMOTIONAL  CONNOTATIONS 


Sublime  connotations  t  1 
Pleasant  connotations  ■  50 
Neutral  connotations  i  562 
Unpleasant  connotations  i  32 
Horrid  connotations  i  4 


Percent  of  sublime  connotetions  0 
Percent  of  pleasant  connotations  4 
Percent  of  unpleasant  connotations  3 
Percent  of  horrid  connotations  0 

Index  of  Emotionality  14 


Emotionality  is  average 
Tone  is  Positive 


VIGOR  OF  WORDS 


Word*  of  Extram*  Vigor  i  7 
Word*  of  Much  Vigor  <  34 
Word*  of  Som*  Vigor  i  301 
Words  of  Littl*  Vigor  t  307 


P*rc*nt  of  words  of  axtraw*  vigor  i  0 

P«ro*nt  of  words  of  much  vigor  i  3 

Paroant  of  words  of  soma  vigor  i  29 

Indax  of  Vigor  44 

Vigor  is  iivaiy 


This  is  •  solid  pi«c*  of  writing  w«il  within  th*  tradition*  of  its  ganra. 

You  ara  as  abla  to  undarstand  tha  maaning  of  th«  aboyj  charactaristics  as  Th*  St 
✓list . 

Stand  tha  coursa! 


Excarpt  front  Galapagos  by  Vonnagut 

PROFILE 


Total  of  aantancas  i  22 
Total  of  words  i  542 
Total  of  lattars  i  2526 


542  words  ara  anough  for  valid  statistical  analysi 


Of  tha  542  words  in  Exoarpt  from  Galapagos  by  Vonnagut 
449  war#  matchad  to  words  in  tha  Stylist  dictionary. 


82  ’/.  wara  matched. 

This  is  anough  for  statistical  analysis. 


BREAKDOWN  OF  PERCENT  OF  LETTERS  PER  WORD 


3  4  5  6  7  8 

Number  of  Lattara  in  a  Word 


10  11  12  13  14  15 


Tha  avarags  numbar  of  lattara  par  word  i  5.760233918E+00 

A  typical  fiction  taxts  distribution 
raaamblaa  a  tall  ball-ahapad  curva  cantarad  around  fiva  lattara/word. 

Tha  langth  of  tha  words  ia  long. 


Number'  of  medium  length  sentences  10 
Percent  of  medium  length  sentences  45 


Modulation  is  av.ersge 


ETYMOLOGY  OF  WORDS 


Number  of  Latinete  words 
Number  of  Germanic  words 

Etymology  is  very  native. 


86 

184 


DIFFICULTY  OF  VOCABULARY 


PostGraduete  difficulty 
Graduate  difficulty 
High  School  difficulty 
Elementary  difficulty 


Percent  of  Postgraduate  difficulty 
Percent  of  Graduate  difficulty 
Percent  of  High  School  difficulty 


0 

2 

49 

218 


0 

0 

10 


Difficulty  is  easy. 


Number  of  Tangible  words 
Number  of  Intangible  words 

Tangibility  is  tangibible. 


TANGIBILITY 


76 

194 


EMOTIONAL  CONNOTATIONS 


Sublime  connotations  i  2 
Pleasant  connotations  i  79 
Neutral  connotations  <  170 
Unpleasant  connotations  ■  13 
Horrid  connotations  i  6 


Percent  of  sublime  connotations 
Percent  of  pleasant  connotations 
Percent  of  unpleasant  connotations 
Percent  of  horrid  connotations 

Index  of  Emotionality  43 

Emotionality  is  Rich 

Tons  is  Positive 


0 

17 

2 

1 


VIGOR  OF  WORDS 


Words  of  Extreme  Visor  i 
Words  of  Much  Vigor  t 
Words  of  Some  Vigor  i 
Words  of  Little  Vigor  < 


3 

39 

123 

105 


Percent  of  words  of  extreme  vigor  t 
Peroent  of  words  of  much  vigor  t 

Parcent  of  words  of  soms  vigor  i 

Index  of  Vigor  67 

Vigor  is  vary  strong 


0 

8 

27 


RECOMMENDATION  NUMBER  1 


You  tend  to  write  run-on  sentences. 

Check  your  longest  sentencss  for  run  ons. 
Break  tham  up  into  units  of  single  ideas. 
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